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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Choi et . al . 

(ii) TITLE OF INVENTION: Streptococcus pneumoniae Antigens 
Vaccines 

(iii) NUMBER OF SEQUENCES: 454 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: Dell Latitude C610 

(C) OPERATING SYSTEM: Windows 2 000 

(D) SOFTWARE: ASCII Text 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 09/7 65 , 272 

(B) FILING DATE: January 22, 2001 

( C ) CLASSIFICATION : 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/961,083 

(B) FILING DATE: OCT-30-1997 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Lin J. Hymel 

(B) REGISTRATION NUMBER: 45,414 

(C) REFERENCE /DOCKET NUMBER: PB340P2C2 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 610-5790 

(B) TELEFAX: (301) 309-8439 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



TAAAATCTAC 


GACAATAAAA 


ATCAACTCAT 


TGCTGACTTG 


GGTTC TGAAC 


GCCGCGTCAA 


60 


TGCCCAAGCT 


AATGATATTC 


C C AC AGATTT 


GGTTAAGGCA 


ATCGTTTCTA 


TCGAAGACCA 


12 0 


TCGCTTCTTC 


GACCACAGGG 


GGATTGATAC 


CATCCGTATC 


CTGGGAGCTT 


TCTTGCGCAA 


180 


TCTGCAAAGC 


AATTCCCTCC 


AAGGTGGATC 


AACTCTCACC 


CAACAGTTGA 


TTAAGTTGAC 


240 


TTACTTTTCA 


ACTTCGACTT 


CCGACCAGAC 


TATTTCTCGT 


AAGGCTCAGG 


AAGCTTGGTT 


300 


AGC GATTC AG 


TTAGAACAAA 


AAGCAACCAA 


GCAAGAAATC 


TTGACCTACT 


ATATAAATAA 


360 


GGTCTACATG 


TCTAATGGGA 


AC T ATGGAAT 


GC AG AC AGC A 


GCTCAAAACT 


AC T ATGGTAA 


420 


AGACCTCAAT 


AATTTAAGTT 


TACCTCAGTT 


AGCCTTGCTG 


GCTGGAATGC 


CTCAGGCACC 


480 


AAACCAATAT 


G AC C C C T ATT 


CACATCCAGA 


AGCAGCCCAA 


G AC C GC C G AA 


ACTTGGTCTT 


540 


ATCTGAAATG 


AAAAATCAAG 


GCTACATCTC 


TGC TGAAC AG 


TATGAGAAAG 


CAGTCAATAC 


600 


ACCAATTACT 


GATGG AC T AC 


AAAGTCTCAA 


ATCAGCAAGT 


AATTACCCTG 


CTTACATGGA 


660 


TAATTACCTC 


AAGGAAGTCA 


TCAATCAAGT 


TGAAGAAGAA 


AC AGGC TATA 


ACCTACTCAC 


720 


AAC TGGGATG 


GATGTCTACA 


CAAATGTAGA 


CCAAGAAGCT 


CAAAAACATC 


TGTGGGATAT 


780 


TTACAATACA 


GACGAATACG 


TTGCCTATCC 


AGACGATGAA 


TTGCAAGTCG 


C TTC TAC CAT 


840 


TGTTGATGTT 


TCTAACGGTA 


AAGTCATTGC 


CCAGCTAGGA 


GCACGCCATC 


AGTCAAGTAA 


900 


TGTTTC CTTC 


GGAATT AAC C 


AAGCAGTAGA 


AACAAACCGC 


GACTGGGGAT 


CAACTATGAA 


960 


ACCGATCACA 


GACTATGCTC 


CTGCCTTGGA 


GT AC GGTGTC 


TAC GATTC AA 


C TGC TAC TAT 


1020 


CGTTCACGAT 


GAGCC CTATA 


ACTACCCTGG 


GACAAATACT 


C CTGTTTAT A 


ACTGGGATAG 


1080 


GGGCTACTTT 


GGCAACATCA 


CCTTGCAATA 


CGCCCTGCAA 


CAATCGCGAA 


ACGTCCCAGC 


1140 


CGTGGAAACT 


CTAAACAAGG 


TCGGACTCAA 


CCGCGCCAAG 


ACTTTCCTAA 


ATGGTCTAGG 


1200 


AATCGACTAC 


CCAAGTATTC 


ACTACTCAAA 


TGCCATTTCA 


AGTAACACAA 


CCGAATCAGA 


1260 


CAAAAAATAT 


GGAGCAAGTA 


GTGAAAAGAT 


GGCTGCTGCT 


TACGCTGCCT 


TTGCAAATGG 


1320 


TGGAAC TTAC 


TATAAACCAA 


TGTATATCCA 


TAAAGTCGTC 


TTTAGTGATG 


GGAGTGAAAA 


1380 


AGAGTTCTCT 


AATGTCGGAA 


CTCGTGCCAT 


GAAGGAAAC G 


AC AGC C TATA 


TGATGACCGA 


1440 


CATGATGAAA 


ACAGTCTTGA 


CTTATGGAAC 


TGGAC GAAAT 


GCCTATCTTG 


CTTGGCTCCC 


1500 


TCAGGCTGGT 


AAAACAGGAA 


CCTCTAACTA 


TACAGACGAG 


GAAATTGAAA 


ACCACATCAA 


1560 


GACCTCTCAA 


TTTGTAGCAC 


CTGATGAACT 


ATTTGC TGGC 


TAT AC GC GT A 


AATATTCAAT 


1620 


GGCTGTATGG 


ACAGGCTATT 


CTAACCGTCT 


GACACCACTT 


GTAGGCAATG 


GCCTTACGGT 


1680 


CGCTGCCAAA 


GTTTACCGCT 


CTATGATGAC 


CTACCTGTCT 


GAAGGAAGCA 


ATC C AGAAGA 


1740 


TTGGAATATA 


CCAGAGGGGC 


TCTACAGAAA 


TGGAGAATTC 


GTATTTAAAA 


ATGGTGCTCG 


1800 
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TTCTACGTGG AACTCACCTG CTCCACAACA ACCCCCATCA AC TGAAAGTT CAAGCTCATC 18 6 0 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA ATAGT AC GAC 192 0 

TACCAATCCT AACAATAATA CGCAACAATC AAATACAACC CCTGATCAAC AAAATC AGAA 198 0 

TCCTCAACCA GCACAACCA 1999 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 66 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys lie Tyr Asp Asn Lys Asn Gin Leu lie Ala Asp Leu Gly Ser Glu 
15 10 15 

Arg Arg Val Asn Ala Gin Ala Asn Asp lie Pro Thr Asp Leu Val Lys 
20 .25 30 

Ala lie Val Ser lie Glu Asp His Arg Phe Phe Asp His Arg Gly lie 
35 40 45 

Asp Thr lie Arg lie Leu Gly Ala Phe Leu Arg Asn Leu Gin Ser Asn 
50 55 60 

Ser Leu Gin Gly Gly Ser Thr Leu Thr Gin Gin Leu lie Lys Leu Thr 
65 70 75 80 



Tyr Phe Ser Thr Ser Thr Ser Asp Gin Thr lie Ser Arg Lys Ala Gin 
85 90 95 

Glu Ala Trp Leu Ala lie Gin Leu Glu Gin Lys Ala Thr Lys Gin Glu 
100 105 110 

lie Leu Thr Tyr Tyr lie Asn Lys Val Tyr Met Ser Asn Gly Asn Tyr 
115 120 125 

Gly Met Gin Thr Ala Ala Gin Asn Tyr Tyr Gly Lys Asp Leu Asn Asn 
130 135 140 

Leu Ser Leu Pro Gin Leu Ala Leu Leu Ala Gly Met Pro Gin Ala Pro 
145 150 155 160 

Asn Gin Tyr Asp Pro Tyr Ser His Pro Glu Ala Ala Gin Asp Arg Arg 
165 170 175 

Asn Leu Val Leu Ser Glu Met Lys Asn Gin Gly Tyr lie Ser Ala Glu 
180 185 190 

Gin Tyr Glu Lys Ala Val Asn Thr Pro lie Thr Asp Gly Leu Gin Ser 
195 200 205 

Leu Lys Ser Ala Ser Asn Tyr Pro Ala Tyr Met Asp Asn Tyr Leu Lys 
210 215 220 

Glu Val He Asn Gin Val Glu Glu Glu Thr Gly Tyr Asn Leu Leu Thr 



4 



225 



230 235 240 



Thr Gly Met Asp Val Tyr Thr Asn Val Asp Gin Glu Ala Gin Lys His 
245 250 255 

Leu Trp Asp He Tyr Asn Thr Asp Glu Tyr Val Ala Tyr Pro Asp Asp 
260 265 270 

Glu Leu Gin Val Ala Ser Thr He Val Asp Val Ser Asn Gly Lys Val 
275 280 285 

He Ala Gin Leu Gly Ala Arg His Gin Ser Ser Asn Val Ser Phe Gly 
290 295 300 

He Asn Gin Ala Val Glu Thr Asn Arg Asp Trp Gly Ser Thr Met Lys 
305 310 315 320 

Pro He Thr Asp Tyr Ala Pro Ala Leu Glu Tyr Gly Val Tyr Asp Ser 
325 330 335 

Thr Ala Thr He Val His Asp Glu Pro Tyr Asn Tyr Pro Gly Thr Asn 
340 345 350 

Thr Pro Val Tyr Asn Trp Asp Arg Gly Tyr Phe Gly Asn He Thr Leu 
355 360 365 

Gin Tyr Ala 'Leu Gin Gin Ser Arg Asn Val Pro Ala Val Glu Thr Leu 
370 375 380 

Asn Lys Val Gly Leu Asn Arg Ala Lys Thr Phe Leu Asn Gly Leu Gly 
385 390 395 400 

He Asp Tyr Pro Ser He His Tyr Ser Asn Ala He Ser Ser Asn Thr 
405 410 415 

Thr Glu Ser Asp Lys Lys Tyr Gly Ala Ser Ser Glu Lys Met Ala Ala 
420 425- 430 

Ala Tyr Ala Ala Phe Ala Asn Gly Gly Thr Tyr Tyr Lys Pro Met Tyr 
435 440 445 

He His Lys Val Val Phe Ser Asp Gly Ser Glu Lys Glu Phe Ser Asn 
450 455 460 

Val Gly Thr Arg Ala Met Lys Glu Thr Thr Ala Tyr Met Met Thr Asp 
465 470 475 480 

Met Met Lys Thr Val Leu Thr Tyr Gly Thr Gly Arg Asn Ala Tyr Leu 
485 490 495 

Ala Trp Leu Pro Gin Ala Gly Lys Thr Gly Thr Ser Asn Tyr Thr Asp 
500 505 510 

Glu Glu He Glu Asn His He Lys Thr Ser Gin Phe Val Ala Pro Asp 
515 520 525 

Glu Leu Phe Ala Gly Tyr Thr Arg Lys Tyr Ser Met Ala Val Trp Thr 
530 535 540 

Gly Tyr Ser Asn Arg Leu Thr Pro Leu Val Gly Asn Gly Leu Thr Val 
545 550 555 560 

Ala Ala Lys Val Tyr Arg Ser Met Met Thr Tyr Leu Ser Glu Gly Ser 
565 570 575 
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Asn Pro Glu Asp Trp Asn lie Pro Glu Gly Leu Tyr Arg Asn Gly Glu 
580 585 590 

Phe Val Phe Lys Asn Gly Ala Arg Ser Thr Trp Asn Ser Pro Ala Pro 
595 600 605 

Gin Gin Pro Pro Ser Thr Glu Ser Ser Ser Ser Ser Ser Asp Ser Ser 
610 615 620 

Thr Ser Gin Ser Ser Ser Thr Thr Pro Ser Thr Asn Asn Ser Thr Thr 
625 630 635 640 

Thr Asn Pro Asn Asn Asn Thr Gin Gin Ser Asn Thr Thr Pro Asp Gin 
645 650 655 

Gin Asn Gin Asn Pro Gin Pro Ala Gin Pro 
660 665 

(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1714 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 



AAATTACAAT 


ACGGACTATG 


AATTGACCTC 


TGGAGAAAAA 


TTACCTCTTC 


C T AAAG AG AT 


60 


TTCAGGTTAC 


AC TTATATTG 


GATATATCAA 


AGAGGGAAAA 


ACGACTTCTG 


AGTC TGAAGT 


120 


AAGTAATCAA 


AAGAGTTCAG 


TTGCCACTCC 


TACAAAACAA 


CAAAAGGTGG 


ATTATAATGT 


180 


TACACCGAAT 


TTTGTAGACC 


ATCCATCAAC 


AGTACAAGCT 


ATTCAGGAAC 


AAACACCTGT 


240 


TTCTTCAACT 


AAGCCGACAG 


AAGTTCAAGT 


AGTTGAAAAA 


CCTTTCTCTA 


CTGAATTAAT 


300 


CAATCCAAGA 


AAAGAAGAGA 


AACAATCTTC 


AGATTCTCAA 


GAACAATTAG 


CCGAACATAA 


360 


GAATCTAGAA 


AC G AAGAAAG 


AGGAGAAGAT 


TTCTCCAAAA 


GAAAAGACTG 


GGGTAAATAC 


420 


ATTAAATCCA 


CAGGATGAAG 


TTTTATCAGG 


TCAATTGAAC 


AAACCTGAAC 


TCTTATATCG 


480 


TGAGGAAACT 


ATGGAGACAA 


AAATAGATTT 


TCAAGAAGAA 


ATTCAAGAAA 


ATCCTGATTT 


540 


AGCTGAAGGA 


AC TGT AAG AG 


TAAAACAAGA 


AGGTAAATTA 


GGTAAGAAAG 


TTGAAATC GT 


600 


CAGAATATTC 


TCTGTAAACA 


AGGAAGAAGT 


TTCGCGAGAA 


ATTGTTTCAA 


CTTCAACGAC 


660 


TGCGCCTAGT 


CCAAGAATAG 


TCGAAAAAGG 


TACTAAAAAA 


ACTCAAGTTA 


TAAAGGAACA 


720 


ACCTGAGACT 


GGTGTAGAAC 


AT AAGGAC GT 


AC AGTC TGGA 


GCTATTGTTG 


AACCCGCAAT 


780 


TCAGCCTGAG 


TTGCCCGAAG 


C TGTAGT AAG 


TGACAAAGGC 


GAACCAGAAG 


TTCAACCTAC 


840 


ATT AC C C G AA 


GCAGTTGTGA 


CCGACAAAGG 


TGAGACTGAG 


GTTCAACCAG 


AGTC GC C AG A 


900 


TACTGTGGTA 


AGTGATAAAG 


GTGAACCAGA 


GCAGGTAGCA 


CCGCTTCCAG 


AATATAAGGG 


960 


TAATATTGAG 


CAAGTAAAAC 


CTGAAACTCC 


GGTTGAGAAG 


ACCAAAGAAC 


AAGGTCCAGA 


1020 
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AAAAACTGAA GAAGTTCCAG TAAAACCAAC AGAAGAAACA CCAGTAAATC CAAATGAAGG 1080 

TACTACAGAA GGAACCTCAA TTCAAGAAGC AGAAAATCCA GTTCAACCTG CAGAAGAATC 1140 

AACAACGAAT TCAGAGAAAG TATCACCAGA TACATCTAGC AAAAAT AC TG GGGAAGTGTC 12 00 

CAGTAATCCT AGTGATTCGA CAACCTCAGT TGGAGAATCA AATAAAC C AG AACATAATGA 12 60 

C TC T AAAAAT GAAAATTCAG AAAAAACTGT AGAAGAAGTT CCAGTAAATC CAAATGAAGG 13 2 0 

CACAGTAGAA GGTACCTCAA ATCAAGAAAC AGAAAAACCA GTTCAACCTG CAGAAGAAAC 13 8 0 

ACAAACAAAC TCTGGGAAAA TAGCTAACGA AAATACTGGA G AAGT AT CCA ATAAACCTAG 1440 

TGATTCAAAA CCACCAGTTG AAGAATCAAA TCAACCAGAA AAAAACGGAA CTGCAACAAA 15 00 

AC C AGAAAAT TCAGGTAATA CAACATCAGA GAATGGACAA AC AGAAC C AG AAC CATC AAA 15 6 0 

CGGAAATTCA ACTGAGGATG TTTCAACCGA ATCAAACACA TCCAATTCAA ATGGAAACGA 162 0 

AGAAATTAAA CAAGAAAATG AAC TAG AC C C TGATAAAAAG GTAGAAGAAC CAGAGAAAAC 16 80 

ACTTGAATTA AGAAATGTTT C C G AC C TAG A GTTA 1714 
(2) INFORMATION FOR SEQ ID NO : 4 : . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

A Tvr Asn Thr Asp Tyr Glu Leu Thr Ser Gly Glu Lys Leu Pro Leu 
1 5 10 15 

Pro Lys Glu He Ser Gly Tyr Thr Tyr He Gly Tyr He Lys Glu Gly 
20 25 30 

Lys Thr Thr Ser Glu Ser Glu Val Ser Asn Gin Lys Ser Ser Val Ala 
35 40 45 

Thr Pro Thr Lys Gin Gin Lys Val Asp Tyr Asn Val Thr Pro Asn Phe 
50 55 60 

Val Asp His Pro Ser Thr Val Gin Ala He Gin Glu Gin Thr Pro Val 
65 70 75 80 

Ser Ser Thr Lys Pro Thr Glu Val Gin Val Val Glu Lys Pro Phe Ser 
85 90 95 

Thr Glu Leu He Asn Pro Arg Lys Glu Glu Lys Gin Ser Ser Asp Ser 
100 105 HO 

Gin Glu Gin Leu Ala Glu His Lys Asn Leu Glu Thr Lys Lys Glu Glu 
115 120 125 

Lys He Ser Pro Lys Glu Lys Thr Gly Val Asn Thr Leu Asn Pro Gin 
130 135 140 



7 



Asp Glu Val Leu Ser Gly Gin Leu Asn Lys Pro Glu Leu Leu Tyr Arg 
145 150 155 160 

Glu Glu Thr Met Glu Thr Lys He Asp Phe Gin Glu Glu He Gin Glu 
165 170 175 

Asn Pro Asp Leu Ala Glu Gly Thr Val Arg Val Lys Gin Glu Gly Lys 
180 185 190 

Leu Gly Lys Lys Val Glu He Val Arg He Phe Ser Val Asn Lys Glu 
195 200 205 

Glu Val Ser Arg Glu He Val Ser Thr Ser Thr Thr Ala Pro Ser Pro 
210 215 220 

Arg He Val Glu Lys Gly Thr Lys Lys Thr Gin Val He Lys Glu Gin 
225 230 235 240 

Pro Glu Thr Gly Val Glu His Lys Asp Val Gin Ser Gly Ala He Val 
245 250 255 

Glu Pro Ala He Gin Pro Glu Leu Pro Glu Ala Val Val Ser Asp Lys 
260 265 270 

Gly Glu Pro Glu Val Gin Pro Thr Leu Pro Glu Ala Val Val Thr Asp 
275 280 285 

Lys Gly Glu Thr Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val Ser 
290 295 300 

Asp Lys Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys Gly 
305 310 315 320 

Asn He Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys Glu 
325 330 335 

Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu Glu 
340 345 350 

Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Glu Gly Thr Ser He Gin 
355 360 365 

Glu Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn Ser 
370 375 380 

Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val Ser 
385 390 395 400 

Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asn Lys Pro 
405 410 415 

Glu His Asn Asp Ser Lys Asn Glu Asn Ser Glu Lys Thr Val Glu Glu 
420 425 430 

Val Pro Val Asn Pro Asn Glu Gly Thr Val Glu Gly Thr Ser Asn Gin 
435 440 445 

Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn Ser 
450 455 460 

Glv Lys He Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro Ser 
465 . 470 475 480 
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Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn Gly 
485 490 495 

Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Thr Thr Ser Glu Asn Gly 
500 505 510 

Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val Ser 
515 520 525 

Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu lie Lys Gin 
530 535 540 

Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys Thr 
545 550 555 560 

Leu Glu Leu Arg Asn Val Ser Asp Leu Glu Leu 
565 570 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



TGAGAATCAA 


GCTACACCCA 


AAG AG AC TAG 


CGCTCAAAAG 


ACAATCGTCC 


TTGCTACAGC 


60 


TGGCGACGTG 


CCACCATTTG 


ACTACGAAGA 


CAAGGGCAAT 


CTGACAGGCT 


TTGATATCGA 


120 


AGTTTTAAAG 


GCAGTAGATG 


AAAAAC TC AG 


C G AC T AC GAG 


ATTCAATTCC 


AAAGAACCGC 


180 


CTGGGAGAGC 


ATCTTCCCAG 


G AC TTGATTC 


TGGTC AC TAT 


CAGGCTGCGG 


CCAATAACTT 


240 


GAGTTACACA 


AAAGAGC GTG 


CTGAAAAATA 


CCTTTACTCG 


CTTCCAATTT 


CCAACAATCC 


300 


CCTCGTCCTT 


GTCAGCAACA 


AG AAAAATC C 


TTTGACTTCT 


CTTGACCAGA 


TCGCTGGTAA 


360 


AACAACACAA 


GAGG AT AC C G 


GAACTTCTAA 


CGCTCAATTC 


ATCAATAACT 


GGAATCAGAA 


420 


ACACACTGAT 


AATCCCGCTA 


CAATTAATTT 


TTCTGGTGAG 


GATATTGGTA 


AACGAATCCT 


480 


AGACCTTGCT 


AAC GGAGAGT 


TTGATTTCCT 


AGTTTTTGAC 


AAGGTATCCG 


TTCAAAAGAT 


540 


TATCAAGGAC 


CGTGGTTTAG 


ACCTCTCAGT 


CGTTGATTTA 


CCTTCTGCAG 


ATAGCCCCAG 


600 


CAATTATATC 


ATTTTCTCAA 


GCGACCAAAA 


AGAGTTTAAA 


GAGCAATTTG 


ATAAAGCGCT 


660 


CAAAGAACTC 


TATCAAGACG 


GAACCCTTGA 


AAAACTCAGC 


AATACCTATC 


TAGGTGGTTC 


720 


TTACCTCCCA 


GATCAATCTC 


AGTTACAA 








748 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser- Ala Gin Lys Thr lie Val 
15 10 15 

Leu Ala Thr Ala Gly Asp Val Pro Pro Phe Asp Tyr Glu Asp Lys Gly 
20 25 30 

Asn Leu Thr Gly Phe Asp lie Glu Val Leu Lys Ala Val Asp Glu Lys 
35 40 45 

Leu Ser Asp Tyr Glu He Gin Phe Gin Arg Thr Ala Trp Glu Ser He 
50 55 60 

Phe Pro Gly Leu Asp Ser Gly His Tyr Gin Ala Ala Ala Asn Asn Leu 
65 70 75 80 

Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro He 
85 90 95 

Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu Thr 
100 105 HO 

Ser Leu Asp Gin He Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly Thr 
115 120 125 

Ser Asn Ala Gin Phe He Asn Asn Trp Asn Gin Lys His Thr Asp Asn 
130 135 140 

Pro Ala Thr He Asn Phe Ser Gly Glu Asp He Gly Lys Arg He Leu 
145 150 155 160 

Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val Ser 
165 170 175 

Val Gin Lys He He Lys Asp Arg Gly Leu Asp Leu Ser Val Val Asp 
180 185 190 

Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr He He Phe Ser Ser Asp 
195 200 205 

Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu Leu Tyr 
210 .215 220 

Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly Ser 
225 230 235 240 

Tyr Leu Pro Asp Gin Ser Gin Leu Gin 
245 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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Asn Gin Ser Ala Trp Glu Gly Leu Gin Ala Trp Gly Lys Glu His Asn 
35 40 45 

Leu Ser Lys Asp Asn Gly Phe Thr Tyr Phe Gin Ser Thr Ser Glu Ala 
50 55 60 

Asp Tyr Ala Asn Asn Leu Gin Gin Ala Ala Gly Ser Tyr Asn Leu lie 
65 70 75 80 

Phe Gly Val Gly Phe Ala Leu Asn Asn Ala Val Lys Asp Ala Ala Lys 
85 9° 95 
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TGGTAACCGC TCTTCTCGTA ACGCAGCTTC ATCTTCTGAT GTGAAGACAA AAGCAGCAAT 

CGTCACTGAT AC TGGTGGTG TTGATGACAA ATCATTCAAC CAATCAGCTT GGGAAGGTTT 120 

GCAGGCTTGG GGTAAAGAAC ACAATCTTTC AAAAGATAAC GGTTTCACTT ACTTCCAATC 180 
AACAAGTGAA GC TG AC T AC G CTAACAACTT GCAACAAGCG GCTGGAAGTT ACAACCTAAT - 240 

CTTCGGTGTT GGTTTTGCCC TTAATAATGC AGTTAAAGAT GCAGCAAAAG AACACACTGA 3 00 

CTTGAACTAT GTCTTGATTG ATGATGTGAT TAAAG AC C AA AAGAATGTTG CGAGCGTAAC 3 60 

TTTCGCTGAT AATGAGTCAG GTTAC CTTGC AGGTGTGGCT GCAGCAAAAA CAACTAAGAC 420 

AAAACAAGTT GGTTTTGTAG GTGGTATCGA ATC TGAAGTT ATCTCTCGTT TTGAAGCAGG 480 

ATTCAAGGCT GGTGTTGCGT CAGTAGACCC ATCTATCAAA GTCCAAGTTG ACTACGCTGG 540 

TTCATTTGGT GATGCGGCTA AAGGTAAAAC AATTGCAGCC GCACAATACG CAGCCGGTGC 600 

AGATATTGTT TACCAAGTAG CTGGTGGTAC AGGTGCAGGT GTCTTTGCAG AGGCAAAATC 660 

TCTCAACGAA AGCCGTCCTG AAAATGAAAA AGTTTGGGTT ATCGGTGTTG ATCGTGACCA 72 0 

AGAAGCAGAA GGTAAATACA CTTC TAAAGA TGGCAAAGAA TCAAACTTTG TTCTTGTATC 7 80 

T AC TTTGAAA CAAGTTGGTA CAACTGTAAA AG AT ATTTC T AACAAGGCAG AAAGAGGAGA 840 

ATTCCCTGGC GGTCAAGTGA TCGTTTACTC ATTGAAGGAT AAAGGGGTTG ACTTGGCAGT 900 

AACAAACCTT TCAGAAGAAG GTAAAAAAGC TGTC GAAGAT GCAAAAGCTA AAATCCTTGA 96 0 
TGGAAGCGTA AAAGTTCCTG AAAAA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser Ser Ser Asp Val Lys Thr 
1 5 1° 15 

Lvs Ala Ala lie Val Thr Asp Thr Gly Gly Val Asp Asp Lys Ser Phe 

25 30 



985 
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Glu His Thr Asp Leu Asn Tyr Val Leu lie Asp Asp Val lie Lys Asp 
100 105 110 

Gin Lys Asn Val Ala Ser Val Thr Phe Ala Asp Asn Glu Ser Gly Tyr 
115 120 125 

Leu Ala Gly Val Ala Ala Ala Lys Thr Thr Lys Thr Lys Gin Val Gly 
130 135 140 

Phe Val Gly Gly He Glu Ser Glu Val He Ser Arg Phe Glu Ala Gly 
145 150 155 160 

Phe Lys Ala Gly Val Ala Ser Val Asp Pro Ser He Lys Val Gin Val 
165 170 175 

Asp Tyr Ala Gly Ser Phe Gly Asp Ala Ala Lys Gly Lys Thr He Ala 
180 . 185 190 

Ala Ala Gin Tyr Ala Ala Gly Ala Asp lie Val Tyr Gin Val Ala Gly 
195 200 205 

Gly Thr Gly Ala Gly Val Phe Ala Glu Ala Lys Ser Leu Asn Glu Ser 
210 215 . 220 

Arg Pro Glu Asn Glu Lys Val Trp Val He Gly Val Asp Arg Asp Gin 
225 230 235 240 

Glu Ala Glu Gly Lys Tyr Thr Ser Lys Asp Gly Lys Glu Ser Asn Phe 
245 250 255 

Val Leu Val Ser Thr Leu Lys Gin Val Gly Thr Thr Val Lys Asp He 
260 265 270 

Ser Asn Lys Ala Glu Arg Gly Glu Phe Pro Gly Gly Gin Val He Val 
275 280 285 

Tyr Ser Leu Lys Asp Lys Gly Val Asp Leu Ala Val Thr Asn Leu Ser 
290 295 300 

Glu Glu Gly Lys Lys Ala Val Glu Asp Ala Lys Ala Lys He Leu Asp 
305 310 315 320 

Gly Ser Val Lys Val Pro Glu Lys 
325 

( 2 ) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGCTGATT CAGGTGACAA ACCTGTTATC 60 

AAAATGTACC AAATCGGTGA CAAACCAGAC AACTTGGATG AATTGTTAGC AAATGCCAAC 12 0 

AAAATCATTG AAGAAAAAGT TGGTGCCAAA TTGGATATCC AATACCTTGG CTGGGGTGAC 18 0 

TATGGTAAGA AAATGTCAGT TATCACATCA TCTGGTGAAA ACTATGATAT TGCCTTTGCA 240 



12 



GATAACTATA TTGTAAATGC TCAAAAAGGT GCTTACGCTG ACTTGACAGA ATT GT AC AAA 3 00 

AAAGAAGGTA AAGACCTTTA CAAAGCACTT GACCCAGCTT ACATCAAGGG TAATACTGTA 3 60 

AATGGTAAGA TTTACGCTGT TCCAGTTGCA GCCAACGTTG CATCATCTCA AAACTTTGCC 42 0 
TTCAACGGAA CTCTCCTTGC TAAATATGGT ATCGATATTT CAGGTGTTAC TTCTTACGAA 48 0 

ACTCTTGAGC CAGTCTTGAA AC AAAT C AAA GAAAAAGCTC CAGACGTAGT ACCATTTGCT 540 
ATTGGTAAAG TTTTCATCCC ATCTGATAAT TTTGAC TAC C CAGTAGCAAA CGGTCTTCCA 6 00 

TTCGTTATCG ACCTTGAAGG CGATACTACT AAAGTTGTAA AC C GTT AC G A AGTGCCTCGT 6 60 

TTCAAAGAAC AC TT GAAG AC TCTTCACAAA TTC TATGAAG CTGGCTACAT TCCAAAAGAC 72 0 

GTCGCAACAA GC GAT AC TTC C TTTGAC CTT CAACAAGATA CTTGGTTCGT TCGTGAAGAA 7 80 

ACAGTAGGAC CAGCTGACTA CGGTAACAGC TTGCTTTCAC GTGTTGCCAA CAAAGATATC 840 
CAAATCAAAC CAATTACTAA CTTCATCAAG NAAAAC C AAA CAACACAAGT TGCTAACTTT 9 00 

GTCATCTCAA ACAACTCTAA GAACAAAGAA AAATCAATGG AAATC TTG AA CCTCTTGAAT 9 60 

ACGAACCCAG AAC TCTTGAA CGGTCTTGTT TACGGTCCAG AAGGCAAGAA C TGGGAAAAA 102 0 

ATTGAAGGTA AAGAAAACCG TGTTCGCGTT CTTGATGGCT ACAAAGGAAA CACTCAC ATG 108 0 

GGTGGATGGA ACACTGGTAA CAACTGGATC CTTTACATCA AC G AAAAC GT TACAGACCAA 1140 

C AAATC G AAA ATTCTAAGAA AGAATTGGCA GAAGCTAAAG AATCTCCAGC GC TTGGATTT 12 00 

ATCTTCAATA CTGACAATGT GAAATC TGAA ATCTCAGCTA TTGCTAACAC AATGCAACAA 12 60 

TTTGATACAG CTATCAACAC TGGTACTGTA GACCCAGATA AAGCGATTCC AGAATTGATG 13 2 0 

GAAAAATTGA AATC TGAAGG TGCCTACGAA AAAGTATTGA ACGAAATGCA AAAACAATAC 13 80 

GATGAATTCT TGAAAAACAA AAAA 1404 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cvs Glv Asn Leu Thr Gly Asn Ser Lys Lys Ala Ala Asp Ser Gly Asp 
15 10 15 

Lvs Pro Val He Lys Met Tyr Gin He Gly Asp Lys Pro Asp Asn Leu 
20 25 30 

Asp Glu Leu Leu Ala Asn Ala Asn Lys He He Glu Glu Lys Val Gly 
35 40 45 

Ala Lys Leu Asp He Gin Tyr Leu Gly Trp Gly Asp Tyr Gly Lys Lys 
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50 55 60 

Met Ser Val He Thr Ser Ser Gly Glu Asn Tyr Asp He Ala Phe Ala 
65 70 75 80 

Asp Asn Tyr He Val Asn Ala Gin Lys Gly Ala Tyr Ala Asp Leu Thr 
85 90 95 

Glu Leu Tyr Lys Lys Glu Gly Lys Asp Leu Tyr Lys Ala Leu Asp Pro 
100 105 HO 

Ala Tyr He Lys Gly Asn Thr Val Asn Gly Lys He Tyr Ala Val Pro 
115 120 125 

Val Ala Ala Asn Val Ala Ser Ser Gin Asn Phe Ala Phe Asn Gly Thr 
130 135 140 

Leu Leu Ala Lys Tyr Gly He Asp He Ser Gly Val Thr Ser Tyr Glu 
145 150 155 160 

Thr Leu Glu Pro Val Leu Lys Gin He Lys Glu Lys Ala Pro Asp Val 
165 170 175 

Val Pro Phe Ala He Gly Lys Val Phe He Pro Ser Asp Asn Phe Asp 
180 185 190 

Tyr Pro Val Ala Asn Gly Leu Pro Phe Val He Asp Leu Glu Gly Asp 
195 200 205 

Thr Thr Lys Val Val Asn Arg Tyr Glu Val Pro Arg Phe Lys Glu His 
210 215 220 

Leu Lys Thr Leu His Lys Phe Tyr Glu Ala Gly Tyr He Pro Lys Asp 
225 230 235 240 

Val Ala Thr Ser Asp Thr Ser Phe Asp Leu Gin Gin Asp Thr Trp Phe 
245 250 255 

Val Arg Glu Glu Thr Val Gly Pro Ala Asp Tyr Gly Asn Ser Leu Leu 
260 265 270 

Ser Arg Val Ala Asn Lys Asp He Gin lie Lys Pro He Thr Asn Phe 
275 280 285 

He Lys Xaa Asn Gin Thr Thr Gin Val Ala Asn Phe Val He Ser Asn 
290 - 295 300 

Asn Ser Lys Asn Lys Glu Lys Ser Met Glu He Leu Asn Leu Leu Asn 
305 310 315 320 

Thr Asn Pro Glu Leu Leu Asn Gly Leu Val Tyr Gly Pro Glu Gly Lys 
325 330 335 

Asn Trp Glu Lys He Glu Gly Lys Glu Asn Arg Val Arg Val Leu Asp 
340 345 350 

Gly Tyr Lys Gly Asn Thr His Met Gly Gly Trp Asn Thr Gly Asn Asn 
355 360 365 

Trp He Leu Tyr He Asn Glu Asn Val Thr Asp Gin Gin lie Glu Asn 
370 375 380 

Ser Lys Lys Glu Leu Ala Glu Ala Lys Glu Ser Pro Ala Leu Gly Phe 
385 390 395 400 
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lie Phe Asn Thr Asp Asn Val Lys Ser Glu lie Ser Ala lie Ala Asn 
405 410 415 

Thr Met Gin Gin Phe Asp Thr Ala lie Asn Thr Gly Thr Val Asp Pro 
420 425 430 

Asp Lys Ala lie Pro Glu Leu Met Glu Lys Leu Lys Ser Glu Gly Ala 
435 440 445 

Tyr Glu Lys Val Leu Asn Glu Met Gin Lys Gin Tyr Asp Glu Phe Leu 
450 455 460 

Lys Asn Lys Lys 
465 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 11: 



TGGTCAAGGA 


ACTGCTTCTA 


AAGACAACAA 


AGAGGCAGAA 


CTTAAGAAGG 


TTGACTTTAT 


60 


CCTAGACTGG 


ACACCAAATA 


CCAACCACAC 


AGGGCTTTAT 


GTTGCCAAGG 


AAAAAGGTTA 


120 


TTTCAAAGAA 


GCTGGAGTGG 


ATGTTGATTT 


GAAATTGCCA 


CCAGAAGAAA 


GTTCTTCTGA 


180 


CTTGGTTATC 


AAC GGAAAGG 


CACCATTTGC 


AGTGTATTTC 


CAAGACTACA 


TGGC TAAGAA 


240 


ATTGGAAAAA 


GGAGCAGGAA 


TCACTGCCGT 


TGCAGCTATT 


GTTGAACACA 


ATACATCAGG 


300 


AATCATCTCT 


CGTAAATCTG 


ATAATGTAAG 


CAGTCCAAAA 


GACTTGGTTG 


GTAAGAAATA 


360 


TGGGACATGG 


AATGACCCAA 


CTGAACTTGC 


TATGTTGAAA 


ACCTTGGTAG 


AATCTCAAGG 


420 


TGGAGACTTT 


GAGAAGGTTG 


AAAAAGT AC C 


AAATAACGAC 


TCAAACTCAA 


TCACACCGAT 


480 


TGCCAATGGC 


GTC TTTGAT A 


CTGCTTGGAT 


TTACTACGGT 


TGGGATGGTA 


TCCTTGCTAA 


540 


ATCTCAAGGT 


GTAGATGCTA 


ACTTCATGTA 


C TTGAAAG AC 


TATGTCAAGG 


AGTTTGAC T A 


600 


CTATTCACCA 


GTTATCATCG 


CAAACAACGA 


CTATCTGAAA 


GATAACAAAG 


AAGAAGCTCG 


660 


CAAAGTCATC 


CAAGCCATCA 


AAAAAGGC T A 


CCAATATGCC 


ATGGAACATC 


CAGAAGAAGC 


720 


TGCAGATATT 


CTCATCAAGA 


ATGCACCTGA 


ACTCAAGGAA 


AAACGTGACT 


TTGTCATCGA 


780 


ATCTCAAAAA 


TACTTGTCAA 


AAGAAT AC GC 


AAGCGACAAG 


GAAAAATGGG 


GTCAATTTGA 


840 


CGCAGCTCGC 


TGGAATGCTT 


TCTACAAATG 


GGATAAAGAA 


AATGGTATCC 


TTAAAGAAGA 


900 


CTTGACAGAC 


AAAGGC TTC A 


CCAACGAATT 


TGTGAAA 






937 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glv Gin Gly Thr Ala Ser Lys Asp Asn Lys Glu Ala Glu Leu Lys Lys 
- r- in IS 



1 5 10 15 

Val Asp Phe lie Leu Asp Trp Thr Pro Asn Thr Asn His Thr Gly Leu 
20 25 30 

Tyr Val Ala Lys Glu Lys Gly Tyr Phe Lys Glu Ala Gly Val Asp Val 
35 40 45 

Asp Leu Lys Leu Pro Pro Glu Glu Ser Ser Ser Asp Leu Val He Asn 
50 55 60 

Glv Lys Ala Pro Phe Ala Val Tyr Phe Gin Asp Tyr Met Ala Lys Lys 
65 70 75 80 

Leu Glu Lys Gly Ala Gly He Thr- Ala Val Ala Ala He Val Glu His 
85 90 95 

Asn Thr Ser Gly He He Ser Arg Lys Ser Asp Asn Val Ser Ser Pro 
100 105 HO 

Lvs Asp Leu Val Gly Lys Lys Tyr Gly Thr Trp Asn Asp Pro Thr Glu 
115 120 125 

Leu Ala Met Leu Lys Thr Leu Val Glu Ser Gin Gly Gly Asp Phe Glu 
130 135 140 

Lys Val Glu Lys Val Pro Asn Asn Asp Ser Asn Ser He Thr Pro He 
145 150 155 160 

Ala Asn Gly Val Phe Asp Thr Ala Trp He- Tyr Tyr Gly Trp Asp Gly 
165 170 175 

He Leu Ala Lys Ser Gin Gly Val Asp Ala Asn Phe Met Tyr Leu Lys 
180 185 190 

Asp Tvr Val Lys Glu Phe Asp Tyr Tyr Ser Pro Val He He Ala Asn 
195 200 205 

Asn Asp Tyr Leu Lys Asp Asn Lys Glu Glu Ala Arg Lys Val He Gin 
210 215 220 

Ala He Lys Lys Gly Tyr Gin Tyr Ala Met Glu His Pro Glu Glu Ala 
225 230 235. 240 

Ala Asp He Leu He Lys Asn Ala Pro Glu Leu Lys Glu Lys Arg Asp 
245 250 255 

Phe Val He Glu Ser Gin Lys Tyr Leu Ser Lys Glu Tyr Ala Ser Asp 
260 265 270 

Lys Glu Lys Trp Gly Gin Phe Asp Ala Ala Arg Trp Asn Ala Phe Tyr 
275 280 285 

Lys Trp Asp Lys Glu Asn Gly He Leu Lys Glu Asp Leu Thr Asp Lys 
290 295 300 
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Gly Phe Thr Asn Glu Phe Val Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TAGCTCAGGT GGAAACGCTG GTTCATCCTC TGGAAAAACA ACTGCCAAAG C TC GC AC TAT 60 

CGATGAAATC AAAAAAAGCG GTGAACTGCG AATCGCCGTG TTTGGAGATA AAAAACCGTT 12 0 

TGGCTAC GTT GACAATGATG GTTCTACCAA GGT AC GC T AC GATATTGAAC TAGGGAACCA 180 

ACTAGCTCAA GACCTTGGTG TCAAGGTTAA ATACATTTCA GTCGATGCTG CCAACCGTGC 240 

GGAAT AC TTG ATTTCAAACA AGGTAGATAT TACTCTTGCT AAC TTT AC AG TAACTGACGA 3 00 

AC GT AAG AAA CAAGTTGATT TTGCCCTTCC ATATATGAAA GTTTCTCTGG GTGTCGTATC 3 60 

ACCTAAGACT GGTCTCATTA CAGACGTCAA ACAACTTGAA GGTAAAACCT TAATTGTCAC 42 0 

AAAAGG AAC G ACTGCTGAGA CTTATTTTGA AAAGAATCAT CCAGAAATCA AACTCCAAAA 480 

ATACGACCAA TACAGTGACT CTTACCAAGC TCTTCTTGAC GG AC GTGGAG ATGCCTTTTC 540 

AACTGACAAT AC GGAAGTTC TAGCTTGGGC GCTTGAAAAT AAAGGATTTG AAGTAGGAAT 600 

TACTTCCCTC GGTGATCCCG ATACCATTGC GGCAGCAGTT CAAAAAGGCA ACCAAGAATT 66 0 

GCTAGACTTC ATCAATAAAG ATATTGAAAA ATT AGG C AAG GAAAACTTCT TC C AC AAGGC 72 0 

CTATGAAAAG ACACTTCACC CAACCTACGG TGACGCTGCT AAAGCAGATG AC CTGGTTGT 7 80 
TGAAGGTGGA AAAGTTGAT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Ser Gly Gly Asn Ala Gly Ser Ser Ser Gly Lys Thr Thr Ala Lys 
1^5 10 15 

Ala Arg Thr He Asp Glu He Lys Lys Ser Gly Glu Leu Arg He Ala 
20 25 30 

Val Phe Gly Asp Lys Lys Pro Phe Gly Tyr Val Asp Asn Asp Gly Ser 
35 40 45 



799 
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Thr Lys Val Arg Tyr Asp He Glu Leu Gly Asn Gin Leu Ala Gin Asp 
50 55 60 

Leu Gly Val Lys Val Lys Tyr He Ser Val Asp Ala Ala Asn Arg Ala 
65 , 70 75 80 

Glu Tyr Leu He Ser Asn Lys Val Asp He Thr Leu Ala Asn Phe Thr 
85 90 95 

Val Thr Asp Glu Arg Lys Lys Gin Val Asp Phe Ala Leu Pro Tyr Met 
100 105 HO 

Lys Val Ser Leu Gly Val Val Ser Pro Lys Thr Gly Leu He Thr Asp 
115 120 125 

Val Lys Gin Leu Glu Gly Lys Thr Leu He Val Thr Lys Gly Thr Thr^ 
130 135 140 

Ala Glu Thr Tyr Phe Glu Lys Asn His Pro Glu He Lys Leu Gin Lys 
145 150 155 160 

Tyr Asp Gin Tyr Ser Asp Ser Tyr Gin Ala Leu Leu Asp Gly Arg Gly 
165 170 175 

Asp Ala Phe Ser Thr Asp Asn Thr Glu . Val Leu Ala Trp Ala Leu Glu 
180 185 190 

Asn Lys Gly Phe Glu Val Gly He Thr Ser Leu Gly Asp Pro Asp Thr 
195 200 205 

He Ala Ala Ala Val Gin Lys Gly Asn Gin Glu Leu Leu Asp Phe He 
210 215 220 

Asn Lys Asp He Glu Lys Leu Gly Lys Glu Asn Phe Phe His Lys Ala 
225 230 235 240 

Tvr Glu Lys Thr Leu His Pro Thr Tyr Gly Asp Ala Ala Lys Ala Asp 
245 250 255 

Asp Leu Val Val Glu Gly Gly Lys Val Asp 
260 265 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1189 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCCAACTAT GGTAAATCTG CGGATGGCAC AGTGAC CATC GAGTATTTCA AC C AGAAAAA 60 

AGAAATG AC C AAAACCTTGG AAGAAATCAC TCGTGATTTT GAGAAGGAAA ACCCTAAGAT 12 0 

CAAGGTCAAA GTCGTCAATG TACCAAATGC TGGTGAAGTA TTGAAGACAC GCGTTCTCGC 180 

AGGAGATGTG CCTGATGTGG TCAATATTTA CCCACAGTCC ATCGAACTGC AAGAATGGGC 240 

AAAAGCAGGT GTTTTTGAAG ATTTGAGCAA CAAAGACTAC CTGAAACGCG TGAAAAATGG 3 00 
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CTACGCTGAA 


AAATATGCTG 


TAAACGAAAA 


AGTTTACAAC 


GTTCCTTTTA 


CAGCTAATGC 


360 


TTATGGAATT 


TACTACAACA 


AAGATAAATT 


CGAAGAACTG 


GGC TTGAAGG 


TTCCTGAAAC 


420 


CTGGGATGAA 


TTTGAAC AGT 


TAGTCAAAGA 


TATCGTTGCT 


AAAGGACAAA 


CACCATTTGG 


480 


AATTGCAGGT 


GCAGATGCTT 


GGACACTCAA 


TGGTTACAAT 


C AATTAGC C T 


TTGCGACAGC 


540 


AACAGGTGGA 


GGAAAAGAAG 


CAAATCAATA 


CCTTCGTTAT 


TCTCAACCAA 


ATGCCATTAA 


600 


ATTGTCGGAT 


CCGATTATGA 


AAGATGATAT 


CAAGGTCATG 


GACATCCTTC 


GCATCAATGG 


660 


ATCTAAGCAA 


AAGAAC TGGG 


AAGGTGCTGG 


C TAT AC C GAT 


GTTATCGGAG 


CCTTCGCACG 


720 


TGGGGATGTC 


CTCATGACAC 


CAAATGGGTC 


TTGGGCGATC 


AC AGC GATTA 


ATGAACAAAA 


780 


ACCGAACTTT 


AAGATTGGGA 


CCTTCATGAT 


TCCAGGAAAA 


GAAAAAGGAC 


AAAGC TTAAC 


840 


CGTTGGTGCG 


GGAGACTTGG 


CATGGTCTAT 


CTCAGCCACC 


AC C AAAC AT C 


CAAAAGAAGC 


900 


CAATGCCTTT 


GTGGAATATA 


TGACCCGTCC 


AGAAGTCATG 


CAAAAATACT 


AC GATGTGGA 


960 


CGGATCTCCA 


ACAGCGATCG 


AAGGGGTCAA 


ACAAGCAGGA 


GAAGATTCAC 


CGCTTGCTGG 


1020 


TATGACCGAA 


TATGCCTTTA 


CGGATCGTCA 


CTTGGTCTGG 


TTGCAACAAT 


AC TGGAC C AG 


1080 


TGAAGCAGAC 


TTC CAT AC C T 


TGACCATGAA 


CTATGTCTTG 


ACCGGTGATA 


AACAAGGCAT 


1140 


GGTCAATGAT 


TTGAATGCCT 


TCTTTAACCC 


GATGAAAGCG 


GATGTGGAT 




1189 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Ser Asn Tyr Gly Lys Ser Ala Asp Gly Thr- Val Thr He Glu Tyr Phe 
! 5 10 15 

Asn Gin Lys Lys Glu Met Thr Lys Thr Leu Glu Glu He Thr Arg Asp 
20 25 30 

Phe Glu Lys Glu Asn Pro Lys He Lys Val Lys Val Val Asn Val Pro 
35 40 45 

Asn Ala Gly Glu Val Leu Lys Thr Arg Val Leu Ala Gly Asp Val Pro 
50 55 60 

Asp Val Val Asn He Tyr Pro Gin Ser He Glu Leu Gin Glu Trp Ala 
65 70 75 80 

Lys Ala Gly Val Phe Glu Asp Leu Ser Asn Lys Asp Tyr Leu Lys Arg 
y 85 90 95 

Val Lys Asn Gly Tyr Ala Glu Lys Tyr Ala Val Asn Glu Lys Val Tyr 
100 105 HO 



19 



Asn Val Pro Phe Thr Ala Asn Ala Tyr Gly He Tyr Tyr Asn Lys Asp 
115 120 125 

Lys Phe Glu Glu Leu Gly Leu Lys Val Pro Glu Thr Trp Asp Glu Phe 
130 135 140 

Glu Gin Leu Val Lys Asp He Val Ala Lys Gly Gin Thr Pro Phe Gly 
145 150 155 160 

He Ala Gly Ala Asp Ala Trp Thr Leu Asn Gly Tyr Asn Gin Leu Ala 
165 170 175 

Phe Ala Thr Ala Thr Gly Gly Gly Lys Glu Ala Asn Gin Tyr Leu Arg 
180 185 190 

Tvr Ser Gin Pro Asn Ala He Lys Leu Ser Asp Pro He Met Lys Asp 
195 200 205 

Asp He Lys Val Met Asp He Leu Arg He Asn Gly Ser Lys Gin Lys 
210 215 220 

Asn Trp Glu Gly Ala Gly Tyr Thr Asp Val He Gly Ala Phe Ala Arg 
225 230 235 240 

Glv Asp Val Leu Met Thr Pro Asn Gly Ser Trp Ala He Thr Ala He 
245 250 255 

Asn Glu Gin Lys Pro Asn Phe Lys lie Gly Thr Phe Met He Pro Gly 
260 265 270 

Lys Glu Lys Gly Gin Ser Leu Thr Val Gly Ala Gly Asp Leu Ala Trp 
275 280 285 

Ser He Ser Ala Thr Thr Lys His Pro Lys Glu Ala Asn Ala Phe Val 
290 295 300 

Glu Tyr Met Thr Arg Pro Glu Val Met Gin Lys Tyr Tyr Asp Val Asp 
305 310 315 320 

Gly Ser Pro Thr Ala He Glu Gly Val Lys Gin Ala Gly Glu Asp Ser 
325 330 335 

Pro Leu Ala Gly Met Thr Glu Tyr Ala Phe Thr Asp Arg His Leu Val 
340 345 350 

Trp Leu Gin Gin Tyr Trp Thr Ser Glu Ala Asp Phe His Thr Leu Thr 
355 360 365 

Met Asn Tyr Val Leu Thr Gly Asp Lys Gin Gly Met Val Asn Asp Leu 
370 375 380 

Asn Ala Phe Phe Asn Pro Met Lys Ala Asp Val Asp 
385 390 395 

INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 5 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



20 



TGGGAAAAAT TCTAGCGAAA C TAGTGG AG A TAATTGGTCA AAGTAC C AGT CTAACAAGTC 6 0 

TATTACTATT GGATTTGATA GTACTTTTGT TCCAATGGGA TTTGCTCAGA AAGATGGTTC 12 0 

TTATGCAGGA TTTGATATTG ATTTAGCTAC AGCTGTTTTT G AAAAAT AC G GAATC AC GGT 18 0 

AAATTGGCAA CCGATTGATT GGGATTTGAA AGAAGC TGAA TTGACAAAAG GAACGATTGA 240 

TCTGATTTGG AATGGC TATT CCGCTACAGA CGAACGCCGT GAAAAGGTGG C TTTC AGTAA 3 00 

CTCATATATG AAGAATGAGC AGGTATTGGT TACGAAGAAA TCATCTGGTA TCACGACTGC ' 3 60 

AAAGGATATG ACTGGAAAGA CATTAGGAGC TCAAGCTGGT TCATCTGGTT ATGCGGACTT 42 0 

TGAAGCAAAT CCAGAAATTT TGAAGAATAT TGTCGCTAAT AAGGAAGCGA ATCAATACCA 480 

AAC CTTTAAT GAAGCCTTGA TTGATTTGAA AAACGATCGA ATTGATGGTC TATTGATTGA 54 0 
CCGTGTCTAT GCAAACTATT ATTTAGAAGC AGAAGGTGTT TTAAAC GATT ATAATGTCTT ■ 600 

TACAGTTGGA C T AGAAAC AG AAGCTTTTGC GGTTGGAGCC CGTAAGGAAG ATACAAACTT 66 0 

GGTTAAGAAG ATAAATGAAG C TTTTTC TAG TCTTTACAAG GACGGCAAGT TCCAAGAAAT 72 0 

CAGCCAAAAA TGGTTTGGAG AAGATGTAGC AACCAAAGAA GTAAAAGAAG GACAG 77 5 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Glv Lys Asn Ser Ser Glu fhr Ser Gly Asp Asn Trp Ser Lys Tyr Gin 
15 10 15 

Ser Asn Lys Ser He Thr He Gly Phe Asp Ser Thr Phe Val Pro Met 
20 25 30 

Gly Phe Ala Gin Lys Asp Gly Ser Tyr Ala Gly Phe Asp He Asp Leu 
35 40 45 

Ala Thr Ala Val Phe Glu Lys Tyr Gly He Thr Val Asn Trp Gin Pro 
50 55 60 

He Asn Trp Asp Leu Lys Glu Ala Glu Leu Thr Lys Gly Thr lie Asp 
65 70 75 80 

Leu He Trp Asn Gly Tyr Ser Ala Thr Asp Glu Arg Arg Glu Lys Val 
85 90 95 

Ala Phe Ser Asn Ser Tyr Met Lys Asn Glu Gin Val Leu Val Thr Lys 
100 105 110 

Lvs Ser Ser Gly He Thr Thr Ala Lys Asp Met Thr Gly Lys Thr Leu 
115 120 125 



21 

Gly Ala Gin Ala Gly Ser Ser Gly Tyr Ala Asp Phe Glu Ala Asn Pro 
130 135 140 

Glu He Leu Lys Asn He Val Ala Asn Lys Glu Ala Asn Gin Tyr Gin 
145 150 155 160 

Thr Phe Asn Glu Ala Leu He Asp Leu Lys Asn Asp Arg He Asp Gly 
165 170 175 

Leu Leu He Asp Arg Val Tyr Ala Asn Tyr Tyr Leu Glu Ala Glu Gly 
180 185 190 

Val Leu Asn Asp Tyr Asn Val Phe Thr Val Gly Leu Glu Thr Glu Ala 
195 200 205 

Phe Ala Val Gly Ala Arg Lys Glu Asp Thr Asn Leu Val Lys Lys He 
210 215 220 

Asn Glu Ala Phe Ser Ser Leu Tyr Lys Asp Gly Lys Phe Gin Glu He 
225 230 235 240 

Ser Gin Lys Trp Phe Gly. Glu Asp Val Ala Thr Lys Glu Val Lys Glu 
245 250 255 

Gly Gin 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TGCTAGCGGA AAAAAAGATA CAACTTCTGG TCAAAAACTA AAAGTTGTTG CTACAAACTC 60 

AATCATCGCT GATATTACTA AAAATATTGC TGGTGACAAA ATTGACCTTC ATAGTATCGT 12 0 

TCCGATTGGG CAAGACCCAC ACGAATACGA ACCACTTCCT GAAGACGTTA AGAAAACTTC 18 0 

TGAGGC TAAT TTGATTTTCT AT AAC GGT AT CAACCTTGAA ACAGGTGGCA ATGCTTGGTT 24 0 

TACAAAATTG GTAGAAAATG CCAAGAAAAC TGAAAACAAA GACTACTTCG CAGTCAGCGA 300 

CGGCGTTGAT GTTATCTACC TTGAAGGTCA AAATGAAAAA GGAAAAGAAG ACCCACACGC 36 0 

TTGGCTTAAC CTTGAAAACG GTATTATTTT TGCTAAAAAT ATCGCCAAAC AATTGAGCGC 42 0 

CAAAGACCCT AACAATAAAG AATTCTATGA AAAAAATCTC AAAGAATATA CTGATAAGTT 48 0 

AGACAAACTT GATAAAGAAA GTAAGGATAA ATTTAATAAG ATCCCTGCTG AAAAGAAACT 540 

CATTGTAACC AGC GAAGG AG CATTCAAATA CTTCTCTAAA GCCTATGGTG TCCCAAGTGC 600 

TTACATCTGG GAAATCAATA CTGAAGAAGA AGGAACTCCT GAACAAATCA AGACCTTGGT 660 

TGAAAAACTT CGCCAAACAA AAGTTC CATC ACTCTTTGTA GAATCAAGTG TGGATGACCG 72 0 

TCCAATGAAA ACTGTTTCTC AAGACACAAA CATCCCAATC T AC GC TC AAA TCTTTACTGA 780 
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CTCTATCGCA GAACAAGGTA AAGAAGGCGA CAGCTACTAC AGCATGATGA AATACAACCT 84 0 



TGACAAGATT GCTGAAGGAT TGGCAAAA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ala Ser Gly Lys Lys Asp Thr Thr Ser Gly Gin Lys Leu Lys Val Val 
15 10 15 

Ala Thr Asn Ser He He Ala Asp lie Thr Lys Asn He Ala Gly Asp 
20 25 30 

Lys He Asp Leu His Ser He Val Pro He Gly Gin Asp Pro His Glu 
.35 .40 45 

Tyr Glu Pro Leu Pro Glu Asp Val Lys Lys Thr Ser Glu Ala Asn Leu 
50 55 60 

He Phe Tyr Asn Gly He Asn Leu Glu Thr Gly Gly Asn Ala Trp Phe 
65 70 75 80 

Thr Lys Leu Val Glu Asn Ala Lys Lys Thr Glu Asn Lys Asp Tyr Phe 
85 90 95 

Ala Val Ser Asp Gly Val Asp Val He Tyr Leu Glu Gly Gin Asn Glu 
100 105 HO 

Lys Gly Lys Glu Asp Pro His Ala Trp Leu Asn Leu Glu Asn Gly He 
115 120 125 

He Phe Ala Lys Asn lie Ala Lys Gin Leu Ser Ala Lys Asp Pro Asn 
130 135 140 

Asn Lys Glu Phe Tyr Glu Lys Asn Leu Lys Glu Tyr Thr Asp Lys Leu 
145 150 155 160 

Asp Lys Leu Asp Lys Glu Ser Lys Asp Lys Phe Asn Lys He Pro Ala 
165 170 175 

Glu Lys Lys Leu He Val Thr Ser Glu Gly Ala Phe Lys Tyr Phe Ser 
180 185 190 

Lys Ala Tyr Gly Val Pro Ser Ala Tyr lie Trp Glu He Asn Thr Glu 
195 200 205 

Glu Glu Gly Thr Pro Glu Gin He Lys Thr Leu Val Glu Lys Leu Arg 
210 ' 215 220 

Gin Thr Lys Val Pro Ser Leu Phe Val Glu Ser Ser Val Asp Asp Arg 
225 230 235 240 

Pro Met Lys Thr Val Ser Gin Asp Thr Asn He Pro He Tyr Ala Gin 
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He Phe Thr Asp Ser He Ala Glu Gin Gly Lys Glu Gly Asp Ser Tyr 
260 265 270 

Tyr Ser Met Met Lys Tyr Asn Leu Asp Lys lie Ala Glu Gly Leu Ala 
275 280 285 

Lys 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 

* TGGCTC AAAA AATACAGCTT CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT 6 0 

TCAAGAAAAG AAAACATTGA AGTTTATGAC. AGCCAGTTCA CCGTTATCTC CTAAAG AC C C 12 0 

AAATGAAAAG TTAATTTTGC AACGTTTGGA GAAGGAAACT GGC GTTC AT A TTG AC TGGAC 180 

CAACTACCAA TCCGACTTTG CAGAAAAACG TAACTTGGAT ATTTC TAGTG GTGATTTACC 240 

AGATGCTATC CACAACGACG GAGCTTCAGA TGTGGACTTG ATGAAC TGGG CTAAAAAAGG 3 00 

TGTTATTATT CCAGTTGAAG ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT 360 

GGATGAGAAA C C AGAGTAC A AGGCCTTGAT GACAGCACCT GATGGGCACA TTTACTCATT 42 0 

TCCATGGATT G AAGAGC TTG GAGATGGTAA AGAGTCTATT CACAGTGTCA AC G AT ATGGC 480 

TTGGATTAAC AAAGATTGGC TTAAGAAACT TGGTCTTGAA ATGCCAAAAA C T AC TGATGA 540 

TTTGATTAAA GTC CTAGAAG CTTTCAAAAA CGGGGATCCA AATGGAAATG GAG AGGC TG A 600 

TGAAATTC C A TTTTCATTTA TTAGTGGTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC 660 

TGCATTTGGT ATAGGGGATA ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAAGTTGA 72 0 

CTTCACAGCA GATAACGATA ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGC AAGA 7 80 

AAAAGGCCTG ATTGATAAAG AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG 840 

TCATGATCAG AAATTTGGTG TTT AC TTT AC ATGGGATAAG AATAATGTTA CTGGAAGTAA 900 

C G AAAGTT AT GATGTTTTAC CAGTACTTGC TGGAC CAAGT GGTCAAAAAC ACGTAGCTCG 960 

TACAAACGGT ATGGGATTTG CACGTGACAA GATGGTTATT AC C AGTGTAA AC AAAAAC C T 102 0 

AGAATTGACA GCTAAATGGA TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA 1080 

CTGGGGAACT T AC GGAGATG ACAAACAACA AAACATCTTT GAATTGGATC AAGC GTC AAA 1140 

TAGTC TAAAA CACTTACCAC TAAAC GGAAC TGCACCAGCA GAACTTCGTC AAAAGAC TGA 12 00 

AGTAGGAGGA CCACTAGCTA TC C TAGATTC AT AC T ATGGT AAAGTAACAA CCATGCCTGA 12 6 0 
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TGATGC C AAA TGGCGTTTGG ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT 132 0 

CAATAACTAT CCAAGAGTCT TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA 138 0 

AGCAGATATG AATGAC TATA TCTACCGTAA ACGTGCTGAA TGGATTGTAA AT GGC AAT AT 1440 

TGATAC TGAG TGGGATGATT ACAAGAAAGA AC TTGAAAAA TACGGACTTT CTGATTACCT 1500 

C GC T ATT AAA CAAAAATACT AC G AC C AATA CCAAGCAAAC AAAAAC 1546 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Glv Ser Lys Asn Thr Ala Ser Ser Pro Asp Tyr Lys Leu Glu Gly Val 
15 .10 15 

Thr Phe Pro Leu Gin Glu Lys Lys Thr Leu Lys Phe Met Thr Ala Ser 
20 25 30 

Ser Pro Leu Ser Pro Lys Asp Pro Asn Glu Lys Leu lie Leu Gin Arg 
35 40 45 

Leu Glu Lys Glu Thr Gly Val His He Asp Trp Thr Asn Tyr Gin Ser 
50 55 60 

Asp Phe Ala Glu Lys Arg Asn Leu Asp He Ser Ser Gly Asp Leu Pro 
65 70 75 80 

Asp Ala He His Asn Asp Gly Ala Ser Asp Val Asp Leu Met Asn Trp 
85 90 95 

Ala Lys' Lys Gly Val He He Pro Val Glu Asp Leu He Asp Lys Tyr 
100 105 110 

Met Pro Asn Leu Lys Lys He Leu Asp Glu Lys Pro Glu Tyr Lys Ala 
115 120 125 

Leu Met Thr Ala Pro Asp Gly His He Tyr Ser Phe Pro Trp He Glu 
130 135 140 

Glu Leu Gly Asp Gly Lys Glu Ser lie His Ser Val Asn Asp Met Ala 
145 150 155 160 

Trp lie Asn Lys Asp Trp Leu Lys Lys Leu Gly Leu Glu Met Pro Lys 
165 170 175 

Thr Thr Asp Asp Leu He Lys Val Leu Glu Ala Phe Lys Asn Gly Asp 
180 185 190 

Pro Asn Gly Asn Gly Glu Ala Asp Glu He Pro Phe Ser Phe He Ser 
195 200 205 

Gly Asn Gly Asn Glu Asp Phe Lys Phe Leu Phe Ala Ala Phe Gly He 
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210 215 220 

Gly Asp Asn Asp Asp His Leu Val Val Gly Asn Asp Gly Lys Val Asp 
225 230 235 240 

Phe Thr Ala Asp Asn Asp Asn Tyr Lys Glu Gly Val Lys Phe He Arg 
245 250 255 

Gin Leu Gin Glu Lys Gly Leu He Asp Lys Glu Ala Phe Glu His Asp 
260 265 270 

Trp Asn Ser Tyr He Ala Lys Gly His Asp Gin Lys Phe Gly Val Tyr 
275 280 285 

Phe Thr Trp Asp Lys Asn Asn Val Thr Gly Ser Asn Glu Ser Tyr Asp 
290 295 300 

Val Leu Pro Val Leu Ala Gly Pro Ser Gly Gin Lys His Val Ala Arg 
305 310 315 320 

Thr Asn Gly Met Gly Phe Ala Arg Asp Lys Met Val He Thr Ser Val 
325 330 335 

Asn Lys Asn Leu Glu Leu Thr Ala Lys Trp He Asp Ala Gin Tyr Ala 
340 345 350 

Pro Leu Gin Ser Val Gin Asn Asn Trp Gly Thr Tyr Gly Asp Asp Lys 
355 360 365 

Gin Gin Asn lie Phe Glu Leu Asp Gin Ala Ser Asn Ser Leu Lys His 
370 375 380 

Leu Pro Leu Asn Gly Thr Ala Pro Ala Glu Leu Arg Gin Lys Thr Glu 
385 390 395 400 

Val Gly Gly Pro Leu Ala He Leu Asp Ser Tyr Tyr Gly Lys Val Thr 
405 410 415 

Thr Met Pro Asp Asp Ala Lys Trp Arg Leu Asp Leu lie Lys Glu Tyr 
420 425 430 

Tyr Val Pro Tyr Met Ser Asn Val Asn Asn Tyr Pro Arg Val Phe Met 
435 440 445 

Thr Gin Glu Asp Leu Asp Lys He Ala His He Glu Ala Asp Met Asn 
450 455 460 

Asp Tvr He Tyr Arg Lys Arg Ala Glu Trp He Val Asn Gly Asn He 
465 470 475 480 

Asp Thr Glu Trp Asp Asp Tyr Lys Lys Glu Leu Glu Lys Tyr Gly Leu 
485 490 495 

Ser Asp Tyr Leu Ala He Lys Gin Lys Tyr Tyr Asp Gin Tyr Gin Ala 
500 505 510 

Asn Lys Asn 
515 

INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 
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( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23: 

TAGTACAAAC TCAAGCACTA GTCAGACAGA GACCAGTAGC TCTGCTCCAA CAGAGGTAAC 6 0 

CATTAAAAGT TCACTGGACG AGGTCAAACT TTCCAAAGTT CCTGAAAAGA TTGTGACCTT 12 0 

TGACCTCGGC GCTGCGGATA CTATTCGCGC TTTAGGATTT GAAAAAAATA TCGTCGGAAT 18 0 

GCCTACAAAA AC TGTTC C G A CTTATCTAAA AG AC C T AGTG GGAACTGTCA AAAATGTTGG 24 0 

TTCTATGAAA GAACCTGATT T AGAAGC TAT CGCCGCCCTT GAGCCTGATT TGATTATCGC 300 

TTCGCCACGT ACACAAAAAT TC GT AG AC AA ATTCAAAGAA ATCGCCCCAA CCGTTCTCTT 360 

CCAAGCAAGC AAGGACGACT AC TGG AC TTC TACCAAGGCT AATATC GAAT CCTTAGCAAG 42 0 

TGCCTTCGGC GAAAC TGGT A CACAGAAAGC CAAGGAAGAA TTGACCAAGC TAGACAAGAG 48 0 

CATC C AAGAA GTCGCTACTA AAAATGAAAG CTCTGACAAA AAAGCC CTTG CGATCCTCCT 54 0 

TAATGAAGGA AAAATGGCAG CCTTTGGTGC. CAAATCTCGT TTCTCTTTCT TGTACCAAAC 60 0 

CTTGAAATTC AAACCAACTG ATACAAAATT TGAAGAC TC A CGCCACGGAC AAGAAGTCAG 66 0 

CTTTGAAAGT GTCAAAGAAA TCAACCCTGA CATCCTCTTT GTCATCAACC GTACCCTTGC 72 0 

CATCGGTGGG GACAACTCTA GCAACGACGG TGTCCTAGAA AATGCCCTTA TCGCTGAAAC 780 

ACCTGCTGCT AAAAATGGTA AGATTATCCA ACTAACACCA GACCTCTGGT ATCTAAGCGG 840 

AGGCGGACTT GAATCAACAA AACTCATGAT TGAAGACATA CAAAAAGCTT TGAAA 89 5 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser Thr Asn Ser Ser Thr Ser Gin Thr Glu Thr Ser Ser Ser Ala Pro 
! 5-10 15 

Thr Glu Val Thr lie Lys Ser Ser Leu Asp Glu Val Lys Leu Ser Lys 
20 25 30 

Val Pro Glu Lys He Val Thr Phe Asp Leu Gly Ala Ala Asp Thr He 
35 40 45 

Arg Ala Leu Gly Phe Glu Lys Asn He Val Gly Met Pro Thr Lys Thr 
50 55 60 

Val Pro Thr Tyr Leu Lys Asp Leu Val Gly Thr Val Lys Asn Val Gly 
65 70 75 80 
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Ser Met Lys Glu Pro Asp Leu Glu Ala He Ala Ala Leu Glu Pro Asp 
85 90 95 

Leu He He Ala Ser Pro Arg Thr Gin Lys Phe Val Asp Lys Phe Lys 
100 105 110 

Glu He Ala Pro Thr Val Leu Phe Gin Ala Ser Lys Asp Asp Tyr Trp 
115 120 125 

Thr Ser Thr Lys Ala Asn He Glu Ser Leu Ala Ser Ala Phe Gly Glu 
130 135 140 

Thr Gly Thr Gin Lys Ala Lys Glu Glu Leu Thr Lys Leu Asp Lys Ser 
145 150 155 160 

He Gin Glu Val Ala Thr Lys Asn Glu Ser Ser Asp Lys Lys Ala Leu 
165 170 175 

Ala He Leu Leu Asn Glu Gly Lys Met Ala Ala Phe Gly Ala Lys Ser 
180 185 190 

Arcr Phe Ser Phe Leu Tyr Gin Thr Leu Lys Phe Lys Pro Thr Asp Thr 
195 200 205 

Lys Phe Glu Asp Ser Arg His. Gly Gin Glu Val Ser Phe Glu Ser Val 
210 215 220 

Lvs Glu He Asn Pro Asp He Leu Phe Val He Asn Arg Thr Leu Ala 
225 230 235 240 

He Gly Gly Asp Asn Ser Ser Asn Asp Gly Val Leu Glu Asn Ala Leu 
245 250 255 

He Ala Glu Thr Pro Ala Ala Lys Asn Gly Lys . He He Gin Leu Thr 
260 265 270 

Pro Asp Leu Trp Tyr Leu Ser Gly Gly Gly Leu Glu Ser Thr Lys Leu 
275 280 285 

Met He Glu Asp He Gin Lys Ala Leu Lys 
290 295 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : doubl e 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TGGCAATTCT GGCGGAAGTA AAGATGCTGC CAAATCAGGT GGTGACGGTG CCAAAACAGA 6 0 

AATC AC TTGG TGGGCATTCC CAGTATTTAC CCAAGAAAAA ACTGGTGACG GTGTTGGAAC 12 0 

TTATGAAAAA TCAATCATCG AAGC GTTTGA AAAAGCAAAC CCAGATATAA AAGTGAAATT 180 

GGAAACCATC GACTTCAAGT CAGGTCCTGA AAAAATCACA ACAGCCATCG AAGCAGGAAC 240 

AGCTCCAGAC GTACTCTTTG ATGCACCAGG ACGTATCATC C AATAC GGTA AAAACGGTAA 3 00 
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A 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala Lys Ser Gly Gly Asp Gly 
1 5 10 1:> 

Ala Lys Thr Glu He Thr Trp Trp Ala Phe Pro Val Phe Thr Gin Glu 
20 25 30 

Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu Lys Ser lie He Glu Ala 
35 40 45 

Phe Glu Lys Ala Asn Pro Asp He Lys Val Lys Leu Glu Thr He Asp 

55 60 
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ATTGGC TGAG TTGAATGACC TCTTCACAGA TGAATTTGTT AAAGATGTCA ACAATGAAAA 360 

CATCGTACAA GCAAGTAAAG CTGGAGACAA GGCTTATATG TATCCGATTA GTTCTGCCCC 420 

ATTCTACATG GCAATGAACA AGAAAATGTT AGAAGATGCT GGAGTAGCAA AC C TTGTAAA 480 

AGAAGGTTGG ACAACTGATG ATTTTGAAAA AGTATTGAAA GCACTTAAAG ACAAGGGTTA 540 

CACACCAGGT TCATTGTTCA GTTCTGGTCA AGGGGGAGAC CAAGGAACAC GTGCCTTTAT 600 

CTCTAACCTT TATAGCGGTT CTGTAACAGA TGAAAAAGTT AGCAAATATA CAACTGATGA 660 

TCCTAAATTC GTCAAAGGTC TTGAAAAAGC AACTAGCTGG ATTAAAGACA ATTTGATCAA 720 

TAATGGTTCA CAATTTGACG GTGGGGCAGA TATCCAAAAC TTTGCCAACG GTCAAACATC 780 

TTACACAATC CTTTGGGCAC CAGCTCAAAA TGGTATCCAA GCTAAACTTT TAGAAGCAAG 840 

TAAGGTAGAA GTGGTAGAAG TACCATTCCC ATCAGACGAA GGTAAGCCAG CTCTTGAGTA 900 

CCTTGTAAAC GGGTTTGCAG TATTCAACAA T AAAG AC G AC AAGAAAGTCG CTGCATCTAA 960 

GAAATTCATC CAGTTTATCG CAGATGACAA GGAGTGGGGA CCTAAAGACG TAGTTC GT AC 1020 

AGGTGCTTTC CCAGTCCGTA CTTCATTTGG. AAAAC TTT AT GAAGACAAAC GCATGGAAAC 1080 

AATCAGCGGC TGGACTCAAT ACTACTCACC ATACTACAAC AC T ATTGAT G GATTTGCTGA 1140 

AATGAGAACA CTTTGGTTCC CAATGTTGCA ATCTGTATCA AATGGTG AC G AAAAACCAGC 1200 

AGATGCTTTG AAAGCCTTCA CTGAAAAAGC GAAC GAAAC A ATCAAAAAAG CTATGAAACA 1260 
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Phe Lys Ser Gly Pro Glu Lys He Thr Thr Ala He Glu Ala Gly Thr 
65 70 75 80 

Ala Pro Asp Val Leu Phe Asp Ala Pro Gly Arg He He Gin Tyr Gly 
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85 90 95 

Lys Asn Gly Lys Leu Ala Glu Leu Asn Asp. Leu Phe Thr Asp Glu Phe 
100 105 HO 

Val Lys Asp Val Asn Asn Glu Asn lie Val Gin Ala Ser Lys Ala Gly 
115 120 125 

Asp Lys Ala Tyr Met Tyr Pro lie Ser Ser Ala Pro Phe Tyr Met Ala 
130 135 140 

Met Asn Lys Lys Met Leu Glu Asp Ala Gly Val Ala Asn Leu Val Lys 
145 150 155 160 

Glu Gly Trp Thr Thr Asp Asp Phe Glu Lys Val Leu Lys Ala Leu Lys 
165 170 175 

Asp Lys Gly Tyr Thr Pro Gly Ser Leu Phe Ser Ser Gly Gin Gly Gly 
180 185 190 

V Asp Gin Gly Thr Arg Ala Phe lie Ser Asn Leu Tyr Ser Gly Ser Val 
195 200 205 

Thr Asp Glu Lys Val Ser Lys Tyr Thr Thr Asp Asp Pro Lys Phe Val 
210 215 220 

Lys Gly Leu Glu Lys Ala Thr' Ser Trp lie Lys Asp Asn Leu lie Asn 
225 230 235 240 

Asn Gly Ser Gin Phe Asp Gly Gly Ala Asp He Gin Asn Phe Ala Asn 
245 250 255 

Gly Gin Thr Ser Tyr Thr He Leu Trp Ala Pro, Ala Gin Asn Gly He 
260 265 270 

Gin Ala Lys Leu Leu Glu Ala Ser Lys Val Glu Val Val Glu Val Pro 
275 280 285 

Phe Pro Ser Asp Glu Gly Lys Pro Ala Leu Glu Tyr Leu Val Asn Gly 
290 295 300 

Phe Ala Val Phe Asn Asn Lys Asp Asp Lys Lys Val Ala Ala Ser Lys 
305 310 315 320 

Lys Phe He Gin Phe He Ala Asp Asp Lys Glu Trp Gly Pro Lys Asp 
325 330 335 

Val Val Arg Thr Gly Ala Phe Pro Val Arg Thr Ser Phe Gly Lys Leu 
340 345 350 

Tyr Glu Asp Lys Arg Met Glu Thr He Ser Gly Trp Thr Gin Tyr Tyr 
355 360 365 

Ser Pro Tyr Tyr Asn Thr He Asp Gly Phe Ala Glu Met Arg Thr Leu 
370 375 380 

Trp Phe Pro Met Leu Gin Ser Val Ser Asn Gly Asp Glu Lys Pro Ala 
385 390 395 400 

Asp Ala Leu Lys Ala Phe Thr Glu Lys Ala Asn Glu Thr He Lys Lys 
405 410 415 

Ala Met Lys Gin 
420 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTCACAAGAA AAAACAAAAA ATGAAGATGG AGAAACTAAG ACAGAACAGA CAGCCAAAGC 60 

TGATGGAACA GTCGGTAGTA AGTCTCAAGG AGCTGCCCAG AAGAAAGCAG AAGTGGTCAA 12 0 

TAAAGGTGAT TACTACAGCA TTCAAGGGAA AT AC GATGAA ATCATCGTAG CCAACAAACA 18 0 

CTATCCATTG TCTAAAGACT ATAATCCAGG GGAAAATC C A ACAGCCAAGG CAGAGTTGGT 240 

CAAACTCATC AAAGCGATGC AAGAGGCAGG TTTCCCTATT AGTGATCATT ACAGTGGTTT 3 00 

TAGAAGTTAT GAAACTCAGA CCAAGCTCTA TCAAGATTAT GTCAACCAAG AT G G AAAGGC 3 60 

AGCAGCTGAC CGTTACTCTG CCCGTCCTGG. CTATAGCGAA CACCAGACAG GCTTGGCCTT 42 0 

TGATGTGATT GGGACTGATG GTGATTTGGT GACAGAAGAA AAAGCAGCCC AATGGCTCTT 480 

GGATCATGCA GCTGATTATG GCTTTGTTGT CCGTTATCTC AAAGGCAAGG AAAAGGAAAC 54 0 

AGGCTATATG GCTGAAGAAT GGCACCTGCG TTATGTAGGA AAAGAAGCTA AAGAAATTGC 6 00 

TGCAAGTGGT CTCAGTTTGG AAGAATACTA TGGCTTTGAA GGC GGAGAC T ACGTCGAT 658 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 

Ser Gin Glu Lys Thr Lys Asn Glu Asp Gly Glu Thr Lys Thr Glu Gin 
1 5 10 15 

• Thr Ala Lys Ala Asp Gly Thr Val Gly Ser Lys Ser Gin Gly Ala Ala 
20 25 30 

Gin Lys Lys Ala Glu Val Val Asn Lys Gly Asp Tyr Tyr Ser He Gin 
35 40 45 

Gly Lys Tyr Asp Glu He He Val Ala Asn Lys His Tyr Pro Leu Ser 
50 55 60 

Lys Asp Tyr Asn Pro Gly Glu Asn Pro Thr Ala Lys Ala Glu Leu Val 
65 70 75 80 

Lys Leu He Lys Ala Met Gin Glu Ala Gly Phe Pro He Ser Asp His 
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85 90 95 

Tyr Ser Gly Phe Arg Ser Tyr Glu Thr Gin Thr Lys Leu Tyr Gin Asp 
100 105 110 

Tyr Val Asn Gin Asp Gly Lys Ala Ala Ala Asp Arg Tyr Ser. Ala Arg 
115 120 125 

Pro Gly Tyr Ser Glu His Gin Thr Gly Leu Ala Phe Asp Val lie Gly 
130 135 140 

Thr Asp Gly Asp Leu Val Thr Glu Glu Lys Ala Ala Gin Trp Leu Leu 
145 150 155 160 

£sp His Ala Ala Asp Tyr Gly Phe Val Val Arg Tyr Leu Lys Gly Lys 
165 170 175 

Glu Lys Glu Thr Gly Tyr Met Ala Glu Glu Trp His Leu Arg Tyr Val 
180 185 190 

Gly Lys Glu Ala Lys Glu He Ala Ala Ser Gly Leu Ser Leu Glu Glu 
195 200 205 

Tyr Tyr Gly Phe Glu Gly Gly Asp Tyr Val Asp 
210 215 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GAAAGGTCTG TGGTCAAATA ATCTTACCTG CGGTTATGAT GAAAAAATAA TCTTGGAAAA 6 0 

TATAAATATA AAAATACCTG AAGAAAAAAT ATCAGTTATT ATTGGGTCAA ATGGTTGTGG 12 0 

GAAATCAACA CTCATTAAAA CCTTGTCTCG AC TT AT AAAG CCATTAGAGG GAGAAGTATT 18 0 

GCTTGATAAT AAATCAATTA ATTCTTATAA AGAAAAAGAT TTAGCAAAAC ACATAGCTAT 240 

ATT AC C TC AA TCTCCAATAA TCCCTGAATC AATAACAGTA GCTGATCTTG TAAGCCGTGG 3 00 

TCGTTTCCCC TACAGAAAGC CTTTTAAGAG TCTTGGAAAA GATGACCTTG AAATAATAAA 360 

CAGATCAATG GTTAAGGC C A ATGTT GAAG A TCTAGCAAAT AACCTAGTTG AAGAACTTTC 42 0 

TGGGGGTCAA AGGCAAAGAG TATGGATAGC TCTAGCCCTA GCCCAAGATA CAAGTATCCT 48 0 

ACTTTTAGAT GAGCCAACTA C TT AC TTGGA TATCTCATAT CAAATAGAAC T ATT AG AC C T 540 

CTTGACTGAT CTAAACCAAA AATATAAGAC AACCATTTGC AT G ATTTTGC AC GAT AT AAA 6 00 

TCTAACAGCA AGATACGCTG ATT AC C T ATT TGCAATTAAA GAAGGTAAAC TTGTTGCAGA 6 60 

GGGAAAGCCT GAAGATATAC TAAATGATAA AC TAGTTAAA GATATCTTTA ATCTTGAAGC 720 

AAAAATTATA CGTGACCCTA TTTCCAATTC GCCTCTAATG ATTCCTATTG GCAAGCACCA 7 80 
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TGTTAACTCT 

(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 

Lys Gly Leu Trp Ser Asn Asn Leu Thr Cys Gly Tyr Asp Glu Lys lie 
15 10 15 

He Leu Glu Asn He Asn He Lys He Pro Glu Glu Lys He Ser Val 
20 25 30 

He He Gly Ser Asn Gly Cys Gly Lys Ser Thr Leu He Lys Thr Leu 
35 40 45 

Ser Arg Leu He Lys Pro Leu Glu Gly Glu Val Leu Leu Asp Asn Lys 
50 55 . 60 

Ser He Asn Ser Tyr Lys Glu Lys Asp Leu Ala Lys His He Ala He 
65 70 75 80 

Leu Pro Gin Ser Pro He He Pro Glu Ser He Thr Val Ala Asp Leu 
85 90 95 

Val Ser Arg Gly Arg Phe Pro Tyr Arg Lys Pro Phe Lys Ser Leu Gly 
100 105 HO 

Lys Asp Asp Leu Glu He He Asn Arg Ser Met Val Lys Ala Asn Val 
115 120 125 

Glu Asp Leu Ala Asn Asn Leu Val Glu Glu Leu Ser Gly Gly Gin Arg 
130 135 140 

Glh Arq Val Trp lie Ala Leu Ala Leu Ala Gin Asp Thr Ser He Leu 
145 150 155 160 

Leu Leu Asp Glu Pro Thr Thr Tyr Leu Asp He Ser Tyr Gin He Glu 
165 170 175 

Leu Leu Asp Leu Leu Thr Asp Leu Asn Gin Lys Tyr Lys Thr Thr He 
180 185 190 

Cys Met He Leu His Asp He Asn Leu Thr Ala Arg Tyr Ala Asp Tyr 
195 200 205 

Leu Phe Ala He Lys Glu Gly Lys Leu Val Ala Glu Gly Lys Pro Glu 
210 215 220 

Asp He Leu Asn Asp Lys Leu Val Lys Asp He Phe Asn Leu Glu Ala 
225 230 235 240 

Lys He He Arg Asp Pro He Ser Asn Ser Pro Leu Met He Pro He 
245 250 255 

Gly Lys His His Val Ser 
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260 

(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



AAAC TC AGAA 


AAGAAAGCAG 


ACAATGCAAC 


AAC T ATC AAA 


ATCGCAACTG 


TTAACCGTAG 


60 


CGGTTCTGAA 


G AAAAAC GTT 


GGGACAAAAT 


CCAAGAATTG 


GTTAAAAAAG 


ACGGAATTAC 


120 


CTTGGAATTT 


ACAGAGTTCA 


CAGACTACTC 


AC AAC C AAAC 


AAAGCAACTG 


CTGATGGCGA 


180 


AGTAGATTTG 


AACGCTTTCC 


AAC AC TATAA 


CTTCTTGAAC 


AACTGGAACA 


AAGAAAACGG 


240 


AAAAGACCTT 


GTAGCGATTG 


CAGATACTTA 


CATCTCTCCA 


ATCCGCCTTT 


AC TC AGGTTT 


300 


GAATGGAAGT 


GCCAACAAGT 


ACACTAAAGT 


AGAAGACATC 


CCAGCAAACG 


GAGAAATCGC 


360 


TGTACCGAAT 


G AC GC T AC AA 


AC GAAAGC C G 


TGCGCTTTAT 


TTGCTTCAAT 


CAGCTGGCTT 


420 


GATTAAATTG 


GATGTTTCTG 


GAACTGCTCT 


TGCAACAGTT 


GCCAACATCA 


AAGAAAATCC 


480 


AAAGAACTTG 


AAAATC AC TG 


AATTGG AC GC 


TAGCCAAACA 


GCTCGTTCAT 


TGTCATCAGT 


540 


TGACGCTGCC 


GTTGTAAACA 


ATACCTTCGT 


TACAGAAGCA 


AAATTGG AC T 


ACAAGAAATC 


600 


ACTTTTCAAA 


GAACAAGCTG 


ATGAAAACTC 


AAAACAATGG 


TACAACATCA 


TTGTTGCAAA 


660 


AAAAGATTGG 


GAAACATCAC 


CTAAGGC TGA 


TGCTATCAAG 


AAAGTAATCG 


CAGCTTACCA 


720 


CACAGATGAC 


GTGAAAAAAG 


TTATCGAlAGA 


ATCATCAGAT 


GGTTTGGATC 


AACCAGTTTG 


780 


G 












781 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asn Ser Glu Lys Lys Ala Asp Asn Ala Thr Thr He Lys He Ala Thr 
15 10 15 

Val Asn Arg Ser Gly Ser Glu Glu Lys Arg Trp Asp Lys He Gin Glu 
20 25 30 

Leu Val Lys Lys Asp Gly He Thr Leu Glu Phe Thr Glu Phe Thr Asp 
35 40 45 
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Tyr Ser Gin Pro Asn Lys Ala Thr Ala Asp Gly Glu Val Asp Leu Asn 
50 55 60 

Ala Phe Gin His Tyr Asn Phe Leu Asn Asn Trp Asn Lys Glu Asn Gly 
65 70 75 80 

Lys Asp Leu Val Ala lie Ala Asp Thr Tyr He Ser Pro He Arg Leu 
85 90 95 

Tvr Ser Gly Leu Asn Gly Ser Ala Asn Lys Tyr Thr Lys Val Glu Asp 
Y 100 105 HO 

He Pro Ala Asn Gly Glu He Ala Val Pro Asn Asp Ala Thr Asn Glu 
115 120 > 125 

Ser Arg Ala Leu Tyr Leu Leu Gin Ser Ala Gly Leu He Lys Leu Asp 
130 135 140 

i 

Val Ser Gly Thr Ala Leu Ala Thr Val Ala Asn He Lys Glu Asn Pro 
145 150 155 160 

Lys Asn Leu Lys He Thr Glu Leu Asp Ala Ser Gin Thr Ala Arg Ser 
165 170 175 

Leu Ser Ser Val Asp Ala Ala Val Val Asn Asn Thr Phe Val Thr Glu 
180 185 190 

Ala Lys Leu Asp Tyr Lys Lys Ser Leu Phe Lys Glu Gin Ala Asp Glu 
19 5 ?nn 205 - 



Asn Ser Lys Gin Trp Tyr Asn He He Val Ala Lys Lys Asp Trp Glu 

210 215 220 

Thr Ser Pro Lys Ala Asp Ala He Lys Lys Val He Ala Ala Tyr His 

225 230 235 240 

Thr Asp Asp Val Lys Lys Val He Glu Glu Ser Ser Asp Gly Leu Asp 

iacz 250 . 255 



245 . 250 

Gin Pro Val Trp 
260 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TTC GAAAGGG TCAGAAGGTG CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA 
TCAATTTTAT GAGCAAGTGA AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC 
CATCCAAAAA GTTTTTGAAA AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA 
T AC T ATT GC C GAAGAAAAAA AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC 
AGGTATGACT CTTGAAACAC GTAAAGCTCA AATTC GTAC A AGTAAATTAG TTGAGTTGGC 
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AGTTAAGAAG GTAGCAGAAG CTGAATTGAC AGATGAAGC C TATAAGAAAG CCTTTGATGA 3 60 

GTACACTCCA GATGTAACGG CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGC C AAAGA 42 0 

AGTTCTCGAA AAAGCCAAGG CAGAAGGTGC TGATTTTGCT C AATTAGC C A AAGATAATTC 480 

AAC TGATGAA AAAACAAAAG AAAATGGTGG AGAAATT AC C TTTGATTCTG CTTCAACAGA 540 

AGTACCTGGA GCAAGTCCAA AAAAGCCGCT TTTCGCTTTT AGATGTGGGA TGGTGTTTCT 6 00 

GGATGTGGAT TACAGCAACT GGGGCACACC AAGCCTACAG 640 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ser Lys Gly Ser Glu Gly Ala. Asp Leu He Ser Met Lys Gly Asp Val 
15 10 15 

He Thr Glu His Gin Phe Tyr Glu Gin Val Lys Ser Asn Pro Ser Ala 
20 25 30 

Gin Gin Val Leu Leu Asn Met Thr He Gin Lys Val Phe Glu Lys Gin 
35 • 40 45 

Tvr Glv Ser Glu Leu Asp Asp Lys Glu Val Asp Asp Thr He Ala Glu 
50 55 60 

Glu Lys Lys Gin Tyr Gly Glu Asn Tyr Gin Arg Val Leu Ser Gin Ala 
65 70 75 80 

Gly Met Thr Leu Glu Thr Arg Lys Ala Gin He Arg Thr Ser Lys Leu 
85 90 95 

Val Glu Leu Ala Val Lys Lys Val Ala Glu Ala Glu Leu Thr Asp Glu 
100 105 110 

Ala Tyr Lys Lys Ala Phe Asp Glu Tyr Thr Pro Asp Val Thr Ala Gin 
115 120 125 

He He Arg Leu Asn Asn Glu Asp Lys Ala Lys Glu Val Leu Glu Lys 
130 135 140 

Ala Lys Ala Glu Gly Ala Asp Phe Ala Gin Leu Ala Lys Asp Asn Ser 
145 150 155 160 

Thr Asp Glu Lys Thr Lys Glu Asn Gly Gly Glu He Thr Phe Asp Ser 
165 170 175 

Ala Ser Thr Glu Val Pro Gly Ala Ser Pro Lys Lys Pro Leu Phe Ala 
180 185 190 

Phe Arg Cys Gly Met Val Phe Leu Asp Val Asp Tyr Ser Asn Trp Gly 
195 200 205 
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Thr Pro Ser Leu Gin 
210 

(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA TTGCTCAAAC 6 0 

TC TAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG CTGTTCAGTG 12 0 

TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG TTGTAGAAAA 180 

ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG AATC TAATGG 240 

TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA CAGATGGTAA 3 00 

ATGGTACTAT TTGAATGACT TAGGTGTCAT. GCAGACTGGA TTTGTAAAAT TTTCTGGTAG 3 60 

CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG ATGGTAGCAG 42 0 

ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG AAAATGGCAC 480 

TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG TCGGACCACA 540 

C TGGT AC TAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC CAGATGGTTA 600 

CCGTGTAAAT GGTAATGGTG AATGGGTAAA C 631 
(2) INFORMATION FOR SEQ ID NO : 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 

Gly Met Ala Ala Phe Lys Asn Pro Asn Asn Gin Tyr Lys Ala He Thr 
15 10 15 

He Ala Gin Thr Leu Gly Asp Asp Ala Ser Ser Glu Glu Leu Ala Gly 
20 25 30 

Ara Tyr Gly Ser Ala Val Gin Cys Thr Glu Val Thr Ala Ser Asn Leu 
35 40 45 

Ser Thr Val Lys Thr Lys Ala Thr Val Val Glu Lys Pro Leu Lys Asp 
50 55 60 

Phe Arg Ala Ser Thr Ser Asp Gin Ser Gly Trp Val Glu Ser Asn Gly 
65 70 75 80 
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Lys Trp Tyr Phe ' Tyr Glu Ser Gly Asp Val Lys Thr Gly Trp Val Lys 
85 90 95 

Thr Asp Gly Lys Trp Tyr Tyr Leu Asn Asp Leu Gly Val Met Gin Thr 
100 105 110 

Gly Phe Val Lys Phe Ser Gly Ser Trp Tyr Tyr Leu Ser Asn Ser Gly 
115 120 125 

Ala Met Phe Thr Gly Trp Gly Thr Asp Gly Ser Arg Trp Phe Tyr Phe 
130 135 140 

Asp Gly Ser Gly Ala Met Lys Thr Gly Trp Tyr Lys Glu Asn Gly Thr 
145 150 155 160 

Trp Tyr Tyr Leu Asp Glu Ala Gly lie Met Lys Thr Gly Trp Phe Lys 
165 170 175 

Val Gly Pro His Trp Tyr Tyr Ala Tyr Gly Ser Gly Ala Leu Ala Val 
180 185 190 

Ser Thr Thr Thr Pro Asp Gly Tyr Arg Val Asn Gly Asn Gly Glu Trp 
195 200 205 

Val Asn 
210 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



AGACGAGCAA 


AAAATTAAGC 


AAGCAGAAGC 


GGAAGTTGAG 


AGTAAACAAG 


C TGAGGCTAC 


60 


AAGGTTAAAA 


AAAATCAAGA 


C AGATC GTGA 


AGAAGC AGAA 


GAAGAAGCTA 


AAC GAAGAGC 


120 


AGATGC T AAA 


GAGCAAGGTA 


AACCAAAGGG 


GCGGGCAAAA 


CGAGGAGTTC 


CTGGAGAGCT 


180 


AGCAACACCT 


GATAAAAAAG 


AAAATGATGC 


GAAGTCTTCA 


GATTCTAGCG 


TAGGTGAAGA 


240 


AACTCTTCCA 


AGCCCATCCC 


TGAAAC C AGA 


AAAAAAGGTA 


GC AGAAGC TG 


AGAAGAAGGT 


300 


TGAAGAAGCT 


AAGAAAAAAG 


C C GAGGATC A 


AAAAGAAGAA 


GATCGCCGTA 


ACTACCCAAC 


360 


CAATACTTAC 


AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


420 


GGAGCTTGAA 


CTAGTAAAAG 


AGGAAGCTAA 


GGAACCTCGA 


AACGAGGAAA 


AAGTTAAGCA 


480 


AGCAAAAGCG 


GAAGTTGAGA 


GTAAAAAAGC 


TGAGGCTACA 


AGGTTAGAAA 


AAATCAAGAC 


540 


AGATC GTAAA 


AAAGCAGAAG 


AAGAAGCTAA 


AC GAAAAGC A 


GCAGAAGAAG 


ATAAAGTTAA 


600 


AGAAAAACCA 


GCTGAACAAC 


CACAACCAGC 


GCCGGCTCCA 


AAAGCAGAAA 


AACCAGCTCC 


660 


AGCTCCAAAA 


CCAGAGAATC 


CAGCTGAACA 


AC C AAAAGC A 


GAAAAACCAG 


CTGATCAACA 


720 
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AGCTGAAGAA GACTATGCTC GTAGATCAGA AGAAGAATAT AATC GCTTGA CTCAACAGCA 7 80 

ACCGCCAAAA ACTGAAAAAC CAGCACAACC ATCTACTCCA AAAACAGGCT GGAAACAAGA 84 0 

AAACGGTATG TGGTACTTCT AC AATAC TGA TGGTTCAATG GCGACAGGAT GGCTCCAAAA 9 00 

CAATGGCTCA TGGTACTACC TCAACAGCAA TGGCGCTATG GCGACAGGAT GGCTCCAAAA 9 60 

CAATGGTTCA TGGTACTATC TAAACGCTAA TGGTTCAATG GCAACAGGAT GGCTCCAAAA 102 0 

CAATGGTTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GGCTCCAATA 108 0 

CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GGCTCCAATA 114 0 

CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTGATATG GCGACAGGTT GGGTGAAAGA 12 0 0 

TGGAGATACC TGGTACTATC TTGAAGCATC AGGTGCTATG AAAGCAAGCC AATGGTTCAA 12 6 0 

AGTATCAGAT AAATGGT AC T ATGTCAATGG CTCAGGTGCC CTTGCAGTCA ACACAACTGT 132 0 

AG ATGGC TAT GGAGTCAATG CCAATGGTGA ATGGGTAAAC 13 6 0 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Asp Glu Gin Lys lie Lys Gin Ala Glu Ala Glu Val Glu Ser Lys Gin 
15 10 15 

Ala Glu Ala Thr Arg Leu Lys Lys lie Lys Thr Asp Arg Glu Glu Ala 
20 25 30 

Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro 
35 40 45 

Lys Gly Arg Ala Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp 
50 55 60 

Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu 
65 70 75 80 

Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala 
85 90 95 

Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu 
100 105 110 

Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu 
\_ 115 120 125 

Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu 
130 135 140 

Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin 
145 150 155 160 
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Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu 
165 170 175 

'Lys lie Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys 
180 185 190 

Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin 
195 200 205 

Pro Ala Pro Ala Pro Lys Ala Glu Lys Pro Ala Pro Ala Pro Lys Pro 
210 215 220 

Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu Lys Pro Ala Asp Gin Gin 
225 230 235 240 

Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Glu Glu Tyr Asn Arg Leu 
245 250 255 

Thr Gin Gin Gin Pro Pro Lys Thr Glu Lys Pro Ala Gin Pro Ser Thr 
260 265 270 

Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn 
275 280 285 

Thr Asp Gly Ser Met Ala Thr. Gly Trp Leu Gin Asn Asn Gly Ser Trp 
290 295 300 

Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Asn 
305 310 315 320 

Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser Met Ala Thr Gly 
325 330 - 335 

Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser 
340 345 350 

Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn 
355 360 365 

Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp 
370 375 380 

Tyr Tyr Leu Asn Ala Asn Gly Asp. Met Ala Thr Gly -Trp Val Lys Asp 
385 390 395 400 

Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala Ser 
405 410 415 

Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Ser Gly 
420 425 430 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Gly Val Asn Ala Asn 
435 440 445 

Gly Glu Trp Val Asn 
450 

INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CTGTGGTGAG GAAGAAACTA AAAAGAC TC A AGCAGCACAA CAGCCAAAAC AACAAACGAC 60 

TGTACAACAA ATTGCTGTTG GAAAAGATGC TCCAGACTTC ACATTGCAAT CCATGGATGG 12 0 

CAAAGAAGTT AAGTTATCTG ATTTTAAGGG TAAAAAGGTT TACTTGAAGT TTTGGGCTTC 180 

ATGGTGTGGT CCATGCAAGA AAAGTATGCC AGAGTTGATG GAACTAGCGG C GAAAC C AG A 240 
TCGTGATTTC GAAATTCTTA CTGTCATTGC ACCAGGAATT CAAGGTGAAA AAACTGTTGA ' 3 00 

GCAATTCCCA CAATGGTTCC AGGAACAAGG ATATAAGGAT ATCCCAGTTC TTTATGATAC 3 60 

CAAAGCAACC ACTTCCAAGC TTATCAAATT CGAAGCATTC CTACAGAATA TT 412 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRAKDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cvs Glv Glu Glu Glu Thr Lys Lys Thr Gin Ala Ala Gin Gin Pro Lys 
15 10 15 

Gin Gin Thr Thr Val Gin Gin lie Ala Val Gly Lys Asp Ala Pro Asp 
20 25 30 

Phe Thr Leu Gin Ser Met Asp Gly Lys Glu Val Lys Leu Ser Asp Phe 
35 40 45 

Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp Cys Gly Pro 
50 55 60 

Cys Lys Lys Ser Met Pro Glu Leu Met Glu Leu Ala Ala Lys Pro Asp 
65 70 75 80 

Arg Asp Phe Glu He Leu Thr Val He Ala Pro Gly He Gin Gly Glu 
85 90 95 

Lys Thr Val Glu Gin Phe Pro Gin Trp Phe Gin Glu Gin Gly Tyr Lys 
100 105 HO 

Asp He Pro Val Leu Tyr Asp Thr Lys Ala Thr Thr Ser Lys Leu He 
115 120 125 

Lys Phe Glu Ala Phe Leu Gin Asn He 
130 135 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 



GACTTTTAAC 


AATAAAACTA 


TTGAAGAGTT 


GCACAATCTC 


CTTGTCTCTA 


AGGAAATTTC 


60 


TGCAACAGAA 


TTGACCCAAG 


CAACACTTGA 


AAATATCAAG 


TCTCGTGAGG 


AAGCCCTCAA 


120 


TTCATTTGTC 


ACCATCGCTG 


AGGAGCAAGC 


TCTTGTTCAA 


GCTAAAGCCA 


TTGATGAAGC 


180 


t GGAATTGAT 


GCTGACAATG 


TCCTTTCAGG 


AATTCCACTT 


GCTGTTAAGG 


ATAACATCTC 


240 


TACAGACGGT 


ATTCTCACAA 


CTGCTGCCTC 


AAAAATGCTC 


TACAACTATG 


AGCCAATCTT 


300 


TGATGC GAC a 


gCTgTTGCCA 


ATGCAAAAAC 


CAAGGGCATG 


ATTGTCGTTG 


GAAAGACCAA 


360 


CATGGACGAA 


TTTGCTATGG 


GTGGTTCAGG 


tGAAACTTCA 


CACTACGGAG 


CAACTAAAAA 


420 


CGCTTGGAAC 


CACAGCAAGG 


TTCCTGGTGG 


GTCATCAAGT 


GGTTCTGCCG 


CAGCTGTAGC 


480 


CTCAGGACAA 


GTTCGCTTGT 


C AC TTGGTTC 


TGATACTGGT 


GGTTCCATCC 


GCCAACCTGC 


540 


TGCCTTCAAC 


GGAATC GTTG 


GTCTCAAACC 


AACCTACGGA 


ACAGTTTC AC 


GTTTCGGTCT 


600 


CATTGCCTTT 


GGTAGCTCAT 


TAGACCAGAT 


TGGACCTTTT 


GCTCCTACTG 


TTAAGGAAAA 


660 


TGCCCTCTTG 


CTCAACGCTA 


TTGCCAGCGA 


AG ATGC T AAA 


GAC TC TAC TT 


CTGCTCCTGT 


720 


CCGCATCGCC 


GACTTTACTT 


CAAAAATCGG 


CCAAGACATC 


AAGGGTATGA 


AAATCGCTTT 


780 


GCCTAAGGAA 


TACCTAGGCG 


AAGGAATTGA 


TCCAGAGGTT 


AAGGAAACAA 


TCTTAAACGC 


840 


GGCCAAACAC 


TTTGAAAAAT 


TGGGTGC TAT 


CGTCGAAGAA 


GTCAGCCTTC 


CTCACTCTAA 


900 


AT AC GGTGTT 


GCCGTTTATT 


ACATCATCGC 


TTCATCAGAA 


GCTTCATCAA 


ACTTGCAACG 


960 


CTTCGACGGT 


ATCCGTTACG 


GCTATCGCGC 


AGAAGATGCA 


ACCAACCTTG 


ATGAAATC T A 


1020 


TGTAAACAGC 


CGAAGCCAAG 


GTTTTGGTGA 


AGAGGTAAAA 


CGTCGTATCA 


TGCTGGGTAC 


1080 


TTTCAGTCTT 


TCATCAGGTT 


ACTATGATGC 


C TAC TAC AAA 


AAGGCTGGTC 


AAGTCCGTAC 


1140 


CCTCATCATT 


CAAGATTTCG 


AAAAAGTC TT 


C GC GG ATT AC 


GATTTGATTT 


TGGGTCCAAC 


1200 


TGCTCCAAGT 


GTTGCCTATG 


AC TTGGATTC 


TCTCAACCAT 


GACCCAGTTG 


CCATGTACTT 


1260 


AGCCGACCTA 


TTGACCATAC 


CTGTAAACTT 


GGCAGGACTG 


CCTGGAATTT 


CGATTCCTGC 


1320 


TGGATTCTCT 


CAAGGTCTAC 


CTGTCGGACT 


CCAATTGATT 


GGTCCCAAGT 


ACTCTGAGGA 


1380 


AACCATTTAC 


CAAGCTGCTG 


CTGCTTTTGA 


AGCAACAACA 


GAC TAC C AC A 


AACAACAACC 


1440 


CGTGATTTTT 


GGAGGTGACA 


AC 








1462 


(2) INFORMATION FOR SEQ ID NO: 42 











(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 487 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Thr Phe Asn Asn Lys Thr lie Glu Glu Leu His Asn Leu Leu Val Ser 
15 10 15 

Lys Glu lie Ser Ala Thr Glu Leu Thr Gin Ala Thr Leu Glu Asn lie 
20 25 30 

Lys Ser Arg Glu Glu Ala Leu Asn Ser Phe Val Thr lie Ala Glu Glu 
35 40 45 

Gin Ala Leu Val Gin Ala Lys Ala lie Asp Glu Ala Gly lie Asp Ala 
50 55 60 

Asp Asn Val Leu Ser Gly He Pro Leu Ala Val Lys Asp Asn He Ser 
65 70 75 80 

Thr Asp Gly He Leu Thr Thr Ala Ala Ser Lys Met Leu Tyr Asn Tyr 
85 90 95 

Glu Pro He Phe Asp Ala Thr. Ala Val Ala Asn Ala Lys Thr Lys Gly 
100 105 HO 

Met He Val Val Gly Lys Thr Asn Met Asp Glu Phe Ala Met Gly Gly 
115 120 125 

Ser Gly Glu Thr Ser His Tyr Gly Ala Thr Lys Asn Ala Trp Asn His 
130 135 140 

Ser Lys Val Pro Gly Gly Ser Ser Ser Gly Ser Ala Ala Ala Val Ala 
145 150 155 160 

Ser Gly Gin Val Arg Leu Ser Leu Gly Ser Asp Thr Gly Gly Ser He 
165 170 175 

Arg Gin Pro Ala Ala Phe Asn Gly He Val Gly Leu Lys Pro Thr Tyr 
180 185 190 

Gly Thr Val Ser Arg Phe Gly Leu He Ala "Phe Gly Ser Ser Leu Asp 
195 200 205 

Gin He Gly Pro Phe Ala Pro Thr Val Lys Glu Asn Ala Leu Leu Leu 
210 215 220 

Asn Ala He Ala Ser Glu Asp Ala Lys Asp Ser Thr Ser Ala Pro Val 
225 230 235 - 240 

Arg He Ala Asp Phe Thr Ser Lys He Gly Gin Asp He Lys Gly Met 
245 250 255 

Lys He Ala Leu Pro Lys Glu Tyr Leu Gly Glu Gly He Asp Pro Glu 
260 265 270 

Val Lys Glu Thr He Leu Asn Ala Ala Lys His Phe Glu Lys Leu Gly 
275 280 285 

Ala He Val Glu Glu Val Ser Leu Pro His Ser Lys Tyr Gly Val Ala 
, 290 295 300 



i 
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Val Tyr Tyr lie lie Ala Ser Ser Glu Ala Ser Ser Asn Leu Gin Arg 
305 310 315 320 

Phe Asp Gly He Arg Tyr Gly Tyr Arg Ala Glu Asp Ala Thr Asn Leu 
325 330 335 

Asp Glu He Tyr Val Asn Ser Arg Ser Gin Gly Phe Gly Glu Glu Val 
340 345 350 

Lvs Arq Arg He Met Leu Gly Thr Phe Ser Leu Ser Ser Gly Tyr Tyr 
355 360 365 

Asp Ala Tyr Tyr Lys Lys Ala Gly Gin Val Arg Thr Leu He He Gin 
370 375 380 

Asp Phe Glu Lys Val Phe Ala Asp Tyr Asp Leu He Leu Gly Pro Thr 
385 390 395 400 

Ala Pro Ser Val Ala Tyr Asp Leu Asp Ser Leu Asn His Asp Pro Val 
405 ' 410 , 415 

Ala Met Tyr Leu Ala Asp Leu Leu Thr He Pro Val Asn Leu Ala Gly 
420 425 430 

Leu Pro Gly He Ser He Pro Ala Gly Phe Ser Gin Gly Leu Pro Val 
435 . 440 445 

Gly Leu Gin Leu He Gly Pro Lys Tyr Ser Glu Glu Thr He Tyr Gin 
450 455 460 

Ala Ala Ala Ala Phe Glu Ala Thr Thr Asp Tyr His Lys Gin Gin Pro 
465 470 475 480 

Val He Phe Gly Gly Asp Asn 
485 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CTTTACAGGT AAACAACTAC AAGTCGGCGA CAAGGCGCTT GATTTTTCTC TTACTACAAC 60 

AGATCTTTCT AAAAAATCTC TGGC TGATTT TGATGGCAAG AAAAAAGTCT TGAGTGTCGT 12 0 

TCCTTCTATC GATACAGGCA TCTGCTCAAC TCAAACACGT CGTTTTAATG AAGAATTGGC 180 

TGGACTGGAC AACACGGTCG TATTGACTGT TTCAATGGAC CTACCTTTTG CTCAAAAACG 240 

TTGGTGCGGT GCTGAAGGCC TTGACAATGC CATTATGCTT TCAGACTACT TTGACCATTC 300 

TTTCGGGCGC GATTATGCCC TCTTGATCAA CGAATGGCAC CTATTAGCAC GCGCAGTCTT 3 60 

TGTCCTCGAT ACTGACAATA CGATTCGCTA CGTTGAATAC GTGGATAATA TCAATTCTGA 42 0 

GCCAAACTTC GAA 433 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Phe Thr Gly Lys Gin Leu Gin Val Gly Asp Lys Ala Leu Asp Phe Ser 
15 10 15 

Leu Thr Thr Thr Asp Leu Ser Lys Lys Ser Leu Ala Asp Phe Asp Gly 
20 25 30 

Lys Lys Lys Val Leu Ser Val Val Pro Ser lie Asp Thr Gly lie Cys 
35 40 45 

Ser Thr Gin Thr Arg Arg Phe Asn Glu Glu Leu Ala Gly Leu Asp Asn 
50 55 60 

Thr Val Val Leu Thr Val Ser Met Asp Leu Pro Phe Ala Gin Lys Arg 
65 70 75 80 

Trp Cys Gly Ala Glu Gly Leu Asp Asn Ala lie Met Leu Ser Asp Tyr 
85 90 95 

Phe Asp His Ser Phe Gly Arg Asp Tyr Ala Leu Leu lie Asn Glu Trp 
100 105 110 

His Leu Leu Ala Arg Ala Val Phe Val Leu Asp Thr Asp Asn Thr lie 
115 120 125 

Arg Tyr Val Glu Tyr Val Asp Asn lie Asn Ser Glu Pro Asn Phe Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



CCAGGCTGAT 


ACAAGTATCG 


CAGACATTCA 


AAAAAGAGGC 


GAACTGGTTG 


TCGGTGTCAA 


60 


AC AAGAC GTT 


CCCAATTTTG 


GTTACAAnGA 


TCCCAAGACC 


GGTAC TTATT 


CTGGTATCGA 


120 


AaCCGACTTG GCCAAGATGG TAGC TGATGA ACTCAAGGTC 


AAGATTCGCT 


ATGTGCCGGT 


180 


TACAGCACAA 


ACCCGCGGCC 


CCCTTCTAGA 


CAATGAACAG 


GTCGATATGG 


ATATCGCGAC 


240 


CTTTACCATC 


AC GG AC G AAC 


GCAAAAAACT 


CTACAACTTT 


ACCAGTCCCT 


ACTACACAGA 


300 


CGCTTCTGGA 


TTTTTGGTCA 


ATAAATC TGC 


CAAAATCAAA 


AAGATTGAGG 


ACCTAAACGG 


360 



724 
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CAAAACCATC GGAGTCGCCC AAGGTTC TAT CACCCAACGC CTGATTACTG AACTGGGTAA 42 0 

AAAGAAAGGT CTGAAGTTTA AATTCGTCGA ACTTGGTTCC TACCCAGAAT TGATTAC TTC 48 0 

CCTGCACGCT CATCGTATCG AT AC C TTTTC CGTTGACCGC TCTATTCTAT CTGGCTACAC 540 

TAGTAAAC GG AC AGC AC T AC TAGATGATAG TTTCAAGCCA TCTGACTACG GTATTGTTAC 600 

CAAGAAATCA. AATACAGAGC TCAACGACTA TCTTGATAAC TTGGTTACTA AATGGAGCAA 660 

GGATGGTAGT TTGCAGAAAC TTTATGACCG TTACAAGCTC AAACCATCTA GCCATACTGC 72 0 
AGAT 

(2) INFORMATION FOR SEQ ID NO: 46; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Gin Ala Asp Thr Ser He Ala Asp He Gin Lys Arg Gly Glu Leu Val 
15 10 15 

Val Gly Val Lys Gin Asp Val Pro Asn Phe Gly Tyr Xaa Asp Pro Lys 
20 25 30 

Thr Gly Thr Tyr Ser Gly He Glu Thr Asp Leu Ala Lys Met Val Ala 
35 40 45 

Asp Glu Leu Lys Val Lys He Arg Tyr" Val Pro Val Thr Ala Gin Thr 
50. 55 60 

Arg Gly Pro Leu Leu Asp Asn Glu Gin Val Asp Met Asp He Ala Thr 
65 70 75 80 

Phe Thr He Thr Asp Glu Arg Lys Lys Leu Tyr Asn Phe Thr Ser Pro 
85 90 95 

Tyr Tyr Thr Asp Ala Ser Gly Phe Leu Val Asn Lys Ser Ala Lys He 
100 105 110 

Lys Lys He Glu Asp Leu Asn Gly Lys Thr He Gly Val Ala Gin Gly ' 
115 120 125 

Ser He Thr Gin Arg Leu He Thr Glu Leu Gly Lys Lys Lys Gly Leu 
130 135 140 

Lys Phe Lys Phe Val Glu Leu Gly Ser Tyr Pro Glu Leu He Thr Ser 
145 150 155 160 

Leu His Ala His Arg He Asp Thr Phe Ser Val Asp Arg Ser He Leu 
165 170 175 

Ser Gly Tyr Thr Ser Lys Arg Thr Ala Leu Leu Asp Asp Ser Phe Lys 
180 185 190 

Pro Ser Asp Tyr Gly He Val Thr Lys Lys Ser Asn Thr Glu Leu Asn 
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195 200 205 

Asp Tyr Leu Asp Asn Leu Val Thr Lys Trp Ser Lys Asp Gly Ser Leu 
210 215 220 

Gin Lys Leu Tyr Asp Arg Tyr Lys Leu Lys Pro Ser Ser His Thr Ala 
225 230 235 240 

Asp 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



GTCTGTATCA 


TTTGAAAACA 


AAGAAACAAA 


CCGTGGTGTC 


TTgACTTTCA 


CTATCTCTCA 


60 


AGACCAAATC 


AAACCAGAAT 


TGGACCGTGT. 


CTTCAAGtCA 


GTGAAGAAAT 


CTCTTAATGT 


120 


TCCAGGTTTC 


CGTAAAGGTC 


ACCTTCCACG 


CCCTATCTTC 


G AC C AAAAAT 


TTGGTGAAGA 


180 


AGCTCTTTAT 


CAAGATGCAA 


TGAACGCACT 


TTTGCCAAAC 


GCTTATGAAG 


C AGC TGTAAA 


240 


AGAAGC TGGT 


CTTGAAGTGG 


TTGCCCAACC 


AAAAATTGAC 


GTAACTTCAA 


TGGAAAAAGG 


300 


TCAAGACTGG 


GTTATCACTG 


CTGAAGTCGT 


TACAAAACCT 


GAAGTAAAAT 


TGGGTGAC T A 


360 


CAAAAACCTT 


GAAGTATCAG 


TTGATGTAGA 


AAAAGAAGTA 


ACTGACGCTG 


ATGTC GAAGA 


420 


GCGTATCGAA 


CGCGAACGCA 


ACAACCTGGC 


TGAATTGGTT 


ATC AAGGAAG 


CTGCTGCTGA 


480 


AAACGGCGAC 


ACTGTTGTGA 


TCGACTTCGT 


TGGTTCTATC 


GACGGTGTTG 


AATTTGAC GG 


540. 


TGGAAAAGGT 


GAAAACTTCT 


CACTTGGACT 


TGGTTCAGGT 


CAATTCATCC 


CTGGTTTCGA 


600 


AGACCAATTG 


GTAGGTCACT 


CAGCTGGCGA 


AACCGTTGAT 


GTTATCGTAA 


CATTCCCAGA 


660 


AGACTACCAA 


GCAGAAGACC 


TTGCAGGTAA 


AGAAGC T AAA 


TTCGTGACAA 


CTATCCACGA 


720 


AGTAAAAGCT 


AAAGAAGTTC 


CGGCTCTTGA 


CGATGAACTT 


GCAAAAGACA 


TTGATGAAGA 


780 


AGTTGAAACA 


CTTGCTGACT 


TGAAAGAAAA 


ATACAGCAAA 


GAATTGGCTG 


CTGCTAAAGA 


840 


AGAAGC TT AC 


AAAGATGCAG 


TTGAAGGTGC 


AGCAATTGAT 


AC AGC TGTAG 


AAAATGC TG A 


900 


AATCGTAGAA 


CTTCCAGAAG 


AAATGATCCA 


TGAAGAAGTT 


CACCGTTCAG 


TAAATGAATT 


960 


CCTTGGGAAT 


TTGCAACGTC 


AAGGGATCAA 


CCCTGACATG 


T AC TTC C AAA 


TC AC TGGAAC 


1020 


TACTCAAGAA 


GACCTTCACA 


ACCAATACCA 


AGCAGAAGCT 


GAGTCACGTA 


C TAAG ACT AA 


1080 


CCTTGTTATC 


GAAGCAGTTG 


CCAAAGCTGA 


AGGATTTGAT 


GC TTC AG AAG 


AAG AAATC C A 


1140 


AAAAGAAGTT 


GAGCAATTGG 


CAGCAGACTA 


CAACATGGAA 


GTTGC AC AAG 


TTCAAAACTT 


1200 


GCTTTCAGCT 


GACATGTTGA 


AACATGATAT 


C AC TATC AAA 


AAAGC TGTTG 


AATTGATCAC 


1260 



47 



AAGCACAGCA ACAGTAAAA 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ser Val Ser Phe Glu Asn Lys Glu Thr Asn Arg Gly Val Leu Thr Phe 
1 5 10 15 

Thr He Ser Gin Asp Gin He Lys Pro Glu Leu Asp Arg Val Phe Lys 
20 25 30 

Ser Val Lys Lys Ser Leu Asn Val Pro Gly Phe Arg Lys Gly His Leu 
35 40 45 

Pro Arg Pro He Phe Asp Gin. Lys Phe Gly Glu Glu Ala Leu Tyr Gin 
50 55 60 

Asp Ala Met Asn Ala Leu Leu Pro Asn Ala Tyr Glu Ala Ala Val Lys 
65 70 75 80 

Glu Ala Gly Leu Glu Val Val Ala Gin Pro Lys He Asp Val Thr Ser 
85 90 95 

Met Glu Lys Gly Gin Asp Trp Val He Thr Ala Glu Val Val Thr Lys 
100 105 HO 

Pro Glu Val Lys Leu Gly Asp Tyr Lys Asn Leu Glu Val Ser Val Asp 
115 120 125 

Val Glu Lys Glu Val Thr Asp Ala Asp Val Glu Glu Arg He Glu Arg 
130 135 140 

Glu Arg Asn Asn Leu Ala Glu Leu Val He Lys Glu Ala Ala Ala Glu 
145 150 155 160 

Asn Gly Asp Thr Val Val He Asp Phe Val Gly Ser He Asp Gly Val 
165 170 175 

Glu Phe Asp Gly Gly Lys Gly Glu Asn Phe Ser Leu Gly Leu Gly Ser 
180 185 190 

Gly Gin Phe He Pro Gly Phe Glu Asp Gin Leu Val Gly His Ser Ala 
195 200 205 

Gly Glu Thr Val Asp Val He Val Thr Phe Pro Glu Asp Tyr Gin Ala 
210 215 220 

Glu Asp Leu Ala Gly Lys Glu Ala Lys Phe Val Thr Thr He His Glu 
225 230 235 240 

Val Lys Ala Lys Glu Val Pro Ala Leu Asp Asp Glu Leu Ala Lys Asp 
245 250 255 
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He Asp Glu Glu Val Glu Thr Leu Ala Asp Leu Lys Glu Lys Tyr Ser 
260 265 270 

Lys Glu Leu Ala Ala Ala Lys Glu Glu Ala Tyr Lys Asp Ala Val Glu 
275 280- 285 

Gly Ala Ala He Asp Thr Ala Val Glu Asn Ala Glu He Val Glu Leu 
290 295 300 

Pro Glu Glu Met He His Glu Glu Val His Arg Ser Val Asn Glu Phe 
305 310 315 320 

Leu Gly Asn Leu Gin Arg Gin Gly He Asn Pro Asp Met Tyr Phe Gin 
325 330 335 

He Thr Gly Thr Thr Gin Glu Asp Leu His Asn Gin Tyr Gin Ala Glu 
340 345 350 

Ala Glu Ser Arg Thr Lys Thr Asn Leu Val He Glu Ala Val Ala Lys 
355 360 365 

Ala Glu Gly Phe Asp Ala Ser Glu Glu Glu He Gin Lys Glu Val Glu 
370 375 380 

Gin Leu Ala Ala Asp Tyr Asn Met Glu Val Ala Gin Val Gin Asn Leu 
385 390 395 400 

Leu Ser Ala Asp Met Leu Lys His Asp He Thr He Lys Lys Ala Val 
405 410 415 

Glu Leu He Thr Ser Thr Ala Thr Val Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

TGGTCAAAAG GAAAGTCAGA CAGGAAAGGG GATGAAAATT GTGACCAGTT TTTATC C TAT 60 

CTACGCTATG GTTAAGGAAG TATCTGGTGA CTTGAATGAT GTTCGGATGA TTCAGTCAAG 12 0 

TAGTGGTATT CACTCCTTTG AACCTTCGGC AAATGATATC GCAGCCATCT ATGATGCAGA 180 

TGTC TTTGTT TACCATTCTC ATACACTCGA ATCTTGGGCA GGAAGTCTGG ATCCAAATCT 240 

AAAAAAATCC AAAGTGAAGG TCTTAGAGGC TTCTGAGGGA ATGACCTTGG AACGTGTCCC 3 00 

TGGAC TAG AG GATGTGGAAG CAGGGGATGG AGTTGATGAA AAAACGCTCT ATGACCCTCA 3 60 

C AC ATGGC T A GATCCTGAAA AAGC TGGAGA AGAAGCCCAA ATTATCGCTG ATAAACTTTC 42 0 

AGAGGTGGAT AGTGAGCATA AAGAG AC TT A TCAAAAAAAT GCGCAACCTT TATCAAAAAA 48 0 
GCTCAGGAAT 

(2) INFORMATION FOR SEQ ID NO: 50: 



490 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Gly Gin Lys Glu Ser Gin Thr Gly Lys Gly Met Lys He Val Thr Ser 
X 5 10 1^ 

Phe Tvr Pro He Tyr Ala Met Val Lys Glu Val Ser Gly Asp Leu Asn 
20 ' 25 30 

Asp Val Arg Met He Gin Ser Ser Ser Gly He His Ser Phe Glu Pro 
35 40 45 

Ser Ala Asn Asp lie Ala Ala He Tyr Asp Ala Asp Val Phe Val Tyr 
50 55 60 

His Ser His Thr Leu Glu Ser Trp Ala Gly Ser Leu Asp Pro Asn Leu 
65 70 75 80 

Lvs Lvs Ser Lys Val Lys Val Leu Glu Ala Ser Glu Gly Met Thr Leu 
85 90 95 

Glu Arg Val Pro Gly Leu Glu Asp Val Glu Ala Gly Asp Gly Val Asp 
100 105 HO 

Glu Lys Thr Leu Tyr Asp Pro His Thr Trp Leu Asp Pro Glu Lys Ala 
115 120 125 

Gly Glu Glu Ala Gin lie He Ala Asp Lys Leu Ser Glu Val Asp Ser 
130 135 140 

Glu His Lys Glu Thr Tyr Gin Lys Asn Ala Gin Pro Leu Ser Lys Lys 
145 150 155 160 

Leu Arg Asn 

(2) INFORMATION FOR SEQ ID NO : 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GAAGGATAGA TATATTTTAG CATTTGAGAC ATCCTGTGAT GAGACCAGTG TCGCCGTCTT 60 

G AAAAAC G AC GATGAGCTCT TGTCCAATGT CATTGCTAGT CAAATTGAGA GTCACAAACG 12 0 

TTTTGGTGGC GTAGTGCCCG AAGTAGCCAG TC GTC AC CAT GTCGAGGTCA TTACAGCCTG 18 0 

TATCGAGGAG GC AT TGGC AG AAGCAGGGAT TACCGAAGAG GAC GTGAC AG CTGTTGCGGT 240 
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TACCTACGGA 


CCAGGCTTGG 


TCGGAGCCTT 


CjC_ 1 ACj 1 1 i 




c p a 21 p-p-p p tt 


~k on 

-J u u 


TGCTTGGGCT 


CACGGACTTC 


CACTGATTCC 


TG 1 1 AA 1 LAL 






J o u 


AGCTCAGAGT 


GTGGAGCCTT 


TGGAGTTTCC 


CTTGC rAvjUL- 


/-irpf-i rnrp/-*/~irp/-i 71 
L 1L1 1 OUr 1 A 


rprppppsrs 


f± Z \J 


CACAGAGTTG 


GTTTATGTTT 


CGGAGGCAGG 


AGATTATAACj 


A i lbl I 


AAALLLb 1 


a ft n 
you 


TGATGCGGTT 


GGTGAGGCTT 


ATGATAAGGT 


CGGCCGTGTC 


ATGGGCTTGA 


PPT<7\ TPPTlPP 

LL-TATUL- 1 GU 


J4 U 


AGGTC GTGAG 


ATT G AC GAG C 


TGGCTCATCA 


GGGGCAGGAT 


ATT TAT G ATT 


TLCLL.LG1 GL 


C Pi A 
b U U 


CATGATTAAG 


GAAGATAATC 


TGGAGTTCTC 


CTTCTCAGGT 


TTGAAATCTG 


CCTTTATCAA 


660 


TCTTCATCAC 


AATGCCGAGC 


AAAAGGGAGA 


AAGCCTGTCT 


ACAGAAGATT 


TGTGTGCTTC 


720 


CTTCCAAGCA 


GCAGTTATGG 


ACATTCTCAT 


GGCAAAAACC 


AAGAAGGCTT 


TGGAGAAATA 


780 


TCCTGTTAAA 


ATCCTAGTTG 


TGGCAGGTGG 


TGTGGCAGCC 


AATAAAGGTC 


TCAGAGAACG 


840 


CCTAGCAGCC 


GAAATCACAG 


ATGTCAAGGT 


TATCATCCCC 


CCTCTGCGAC 


TCTGCGGAGA 


900 


CAATGCAGGT 


ATGATTGCCT 


ATGCCAGCGT 


CAGCNAGTGG 


AACAAAGAAA 


ACTTCGCAGG 


960 


CTGGGACCTC 


AATGCCAAAC 


CAAGTCTTGC 


CTTTGAT AC C 


ATGGAA 




1006 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr Ser 
1 5 10 15 

Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val lie Ala 
20 25 30 

Ser Gin lie Glu Ser His Lys Arg Phe Gly Gly Val Val Pro Glu Val 
35 40 45 

Ala Ser Arg His His Val Glu Val lie Thr Ala Cys lie Glu Glu Ala 
50 55 60 

Leu Ala Glu Ala Gly He Thr Glu Glu Asp Val Thr Ala Val Ala Val 
65 70 75 80 

Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser Ala 
85 90 95 

Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu He Pro Val Asn 
100 105 110 

His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu Glu 
115 120 125 
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Phe Pro Leu Leu Ala Leu Leu Val Ser Gly Gly His Thr Glu Leu Val 
130 135 140 

Tyr Val Ser Glu Ala Gly Asp Tyr Lys lie Val Gly Glu Thr Arg Asp 
145 150 155 160 

Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly Leu 
165 170 175 

Thr Tyr Pro Ala Gly Arg Glu lie Asp Glu Leu Ala His Gin Gly Gin 
180 185 190 

Asp lie Tyr Asp Phe Pro Arg Ala Met lie Lys Glu Asp Asn Leu Glu 
195 200 205 

Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe lie Asn Leu His His Asn 
210 215 220 

Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala Ser 
225 230 235 240 

Phe Gin Ala Ala Val Met Asp lie Leu Met Ala Lys Thr Lys Lys Ala 
245 250 255 

Leu Glu Lys Tyr Pro Val Lys lie Leu Val Val Ala Gly Gly Val Ala 
260 265 270 

Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu He Thr Asp Val 
275 280 285 

Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly Met 
290 295 300 

He Ala Tyr Ala Ser Val Ser Xaa Trp Asn Lys Glu Asn Phe Ala Gly 
305 310 315 320 

Trp Asp Leu Asn Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GGTAGTTAAA GTTGGTATTA ACGGTTTCGG ACGTATCGGT CGTCTTGCTT TCCGTCGTAT 60 

CCAAAACGTA GAAGGTGTTG AAGTTACACG- CATCAACGAC CTTACAGATC CAGTTATGCT 12 0 

TGCACACTTG TTGAAATACG ACACAACTCA AGGTC GTTTC GACGGTACTG TTGAAGTTAA 180 

AGAAGGTGGA TTTGAAGTTA AC GGT AAATT CATCAAAGTT TCTGCTGAAC GTGATCCAGA 24 0 

ACAAATCGAC TGGGCTACTG AC GGT GT AGA AATCGTTCTT GAAGCTACTG GTTTCTTTGC 300 

TAAGAAAGAA GCAGCTGAAA AACACCTTAA AGGTGGAGCT AAAAAAGTTG TTATCACTGC 360 

TCCTGGTGGA AACGACGTTA AAACAGTTGT ATTCAACACT AACCACGACG TTCTTGACGG 42 0 
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TACTGAAACA GTTATCTCAG GTGCTTCATG TACTACAAAC TGCTTGGCTC CAATGGCTAA 48 0 

AGCTCTTCAA GACAACTTTG GTGTTGTTGA AGGATTGATG ACTACTATCC ACGCTTACAC 540 

TGGTGACCAA ATGATCCTTG AC GGAC C AC A CCGTGGTGGT GACCTTCGCC GTGCTCGCGC 6 00 

TGGTGC TGC A AACATCGTTC CTAACTCAAC TGGTGCTGCA AAAGCTATCG GTCTTGTAAT 66 0 

CCCAGAATTG AATGGTAAAC TTGAC GGATC TGCACAACGC GTTCCAACTC CAACTGGATC 72 0 

AGTTACTGAA TTGGTAGCAG TTCTTGAAAA GAAC GTTAC T GTTGATGAAG TGAAC GCAGC 7 80 

TATGAAAGCA GCTTCAAACG AATC AT AC GG TTACACAGAA GATCCAATCG TATCTTCAGA 840 

TATCGTAGGT ATGTCTTACG GTTCATTGTT TGACGCAACT CAAACTAAAG TTCTTGACGT 9 00 

TGAC GGT AAA CAATTGGTTA AAGTTGTATC ATGGT AC GAC AACGAAATGT C AT AC AC TGC 960 

AC AAC TTGTT CGTACTCTTG GAATACTTCG CAAAAATTGC 1000 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu Ala 
15 10 15 

Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg lie Asn 
20 25 30 

Asp Leu Thr Asp Pro Val Met Leu Ala His Leu Leu Lys Tyr Asp Thr 
35 40 45 

Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly Phe 
50 55 60 

Glu Val Asn Gly Lys Phe He Lys Val Ser Ala Glu Arg Asp Pro Glu 
65 70 75 80 

Gin He Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala Thr 
85 90 95 

Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu Lys Gly Gly 
100 105 HO 

Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val Lys Thr 
115 120 125 

Val Val Phe Asn Thr Asn His Asp Val Leu Asp Gly Thr Glu Thr Val 
130 135 140 

He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met Ala Lys 
145 150 155 160 
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Ala Leu Gin Asp Asn Phe Gly Val Val Glu Gly Leu Met Thr Thr He 
165 170 175 

His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His Arg Gly 
180 185 190 

Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val Pro Asn 
195 200 205 

Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu Leu Asn 
210 . 215 220 

Glv Lvs Leu Asp Gly Ser Ala Gin Arg Val Pro Thr Pro Thr Gly Ser 
225 230 235 240 

Val Thr Glu Leu Val Ala Val Leu Glu Lys Asn Val Thr Val Asp Glu 
245 250 255 

Val Asn Ala Ala Met Lys Ala Ala Ser Asn Glu Ser Tyr Gly Tyr Thr 
260 265 270 

Glu Asp Pro He Val Ser Ser Asp He Val Gly Met Ser Tyr Gly Ser 
275 280 285 

Leu Phe Asp Ala Thr Gin Thr Lys Val Leu Asp Val Asp Gly Lys Gin 
290 295. 300 

Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr Thr Ala 
305 310 315 320 

Gin Leu Val Arg Thr Leu Gly He Leu Arg Lys Asn Cys 
325 330 

(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 55: 

TTCTTACGAG TTGGGACTGT ATCAAGCTAG AACGGTTAAG GAAAATAATC GTGTTTCCTA 60 

TATAGATGGA AAACAAGCGA CGCAAAAAAC GGAGAATTTG ACTCCTGATG AGGTTAGCAA 12 0 

GCGTGAAGGA ATCAATGCTG AGCAAATCGT CATCAAGATA ACAGACCAAG GCTATGTCAC 180 

TTCACATGGC GACCACTATC ATTATTACAA TGGTAAGGTT CCTTATGACG CTATCATCAG 240 

TGAAGAATTA CTCATGAAAG ATCCAAACTA TAAGCTAAAA GATGAGGATA TTGTTAATGA 3 00 

GGTCAAGGGT GGATATGTTA TCAAGGTAGA TGGAAAATAC TATGTTTACC TTAAGGATGC 3 60 

TGCCCACGCG GATAACGTCC GTACAAAAGA GGAAATCAAT CGACAAAAAC AAGAGCATAG 420 

TCAACATCGT GAAGGTGGAA CTCCAAGAAA CGATGGTGCT GTTGCCTTGG CACGTTCGCA 480 

AGGACGCTAT AC T AC AGATG ATGGTTATAT CTTTAATGCT TCTGATATCA TAGAGGATAC 540 

TGGTGATGCT TATATCGTTC CTCATGGAGA TCATTACCAT TACATTCCTA AGAATGAGTT 6 00 

ATCAGCTAGC GAGTTGGCTG CTGCAGAAGC CTTCCTATCT GGTCGAGGAA ATCTGTCAAA 6 60 
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TTCAAGAACC 


TATCGCCGAC 


AAAAT AGC G A 


TAALALT I LA 


AbAALAAAL I 


bAjvjlACC 1 1L 


*7 0 n 
/ Z u 


TGTAAGCAAT 


C C AGGAAC TA 


C AAAT AC T AA 


CACAACjLAAC 


AAL ALjL AAL A 


C 1 AAC AC» I LA 


-7 0 n 
/ 0 U 


AGCAAGTCAA 


AGTAATGACA 


TTGATAGTCT 


C T TG AAAC AG 


CTC TAC AAAC 


Tippprnrnmp t\ /~> 

TGCCTTTGAG 




TCAACGACAT 


GTAGAATCTG 


ATGGCCTTGT 


C TTTG ATC C A 


GCACAAATCA 


C AAGT C GAAC 


q n n 


AGCTAGAGGT 


GTTGCAGTGC 


CACACGGAGA 


TCATTACCAC 


TTCATCCCTT 


ACTCTCAAAT 


96 0 


GTCTGAATTG 


GAAGAACGAA 


TCGCTCGTAT 


TATTCCCCTT 


CGTTATCGTT 


CAAACCATTG 


102 0 


GGTACCAGAT 


TCAAGGCCAG 


AACAACCAAG 


TCCACAACCG 


AC TC C GGAAC 


CTAGTCCAGG 


1080 


CCCGCAACCT 


GCACCAAATC 


TTAAAATAGA 


CTCAAATTCT 


TCTTTGGTTA 


GTCAGCTGGT 


1140 


AC GAAAAGTT 


GGGGAAGGAT 


ATGTATTCGA 


AGAAAAGGGC 


ATCTCTCGTT 


ATGTCTTTGC 


12 00 


GAAAGATTTA 


CCATCTGAAA 


CTGTTAAAAA 


TCTTGAAAGC 


AAGTTATCAA 


AACAAGAGAG 


12 60 


TGTTTC AC AC 


ACTTTAACTG 


CTAAAAAAGA 


AAATGTTGCT 


CCTCGTGACC 


AAGAATTTTA 


13 2 0 


TGATAAAGCA 


TATAATCTGT 


TAACTGAGGC 


TC AT AAAGC C 


TTGTTTGNAA 


ATAAGGGTCG 


13 8 0 


TAATTCTGAT 


TTCCAAGCCT 


TAGACAAATT 


ATT AG AAC GC 


TTGAATGATG 


AATC GAC T AA 


1440 


TAAAGAAAAA 


TTGGTAGATG 


ATTTATTGGC 


ATTCCTAGCA 


CCAATTACCC 


ATCCAGAGCG 


1500 


ACTTGGCAAA 


CCAAATTCTC 


AAATTGAGTA 


T AC TGAAGAC 


GAAGTTCGTA 


TTGCTCAATT 


1560 


AGCTGATAAG 


TATACAACGT 


CAGATGGTTA 


CATTTTTGAT 


GAACATGATA 


TAATCAGTGA 


1620 


TGAAGGAGAT 


GCATATGTAA 


CGCCTCATAT 


GGGCCATAGT 


CACTGGATTG 


GAAAAGATAG 


1680 


CCTTTCTGAT 


AAGGAAAAAG 


TTGCAGCTCA 


AGC C TAT ACT 


AAAGAAAAAG 


GT ATC C TAC C 


1740 


TCCATCTCCA 


GACGCAGATG 


TTAAAGCAAA 


TCCAACTGGA 


GATAGTGCAG 


CAGCTATTTA 


1800 


CAATCGTGTG 


AAAGGGGAAA 


AACGAATTCC 


ACTCGTTCGA 


CTTCCATATA 


TGGTTGAGCA 


1860 


TACAGTTGAG 


GTTAAAAACG 


GTAATTTGAT 


T ATT C C TC AT 


AAGGATCATT 


AC C AT AAT AT 


192 0 


TAAATTTGCT 


TGGTTTGATG 


ATCACACATA 


CAAAGCTCCA 


AATGGCTATA 


CCTTGGAAGA 


198 0 


TTTGTTTGCG 


AC GATTAAGT 


AC T AC GT AG A 


ACACLCTGAL 


bAAC (j i C CAC 


A 1 J. C i. AA i bA 


Z U4L U 


TGGATGGGGC 


AATGCCAGTG 


AGCATGTGTT 


AGGCAAGAAA 


GACCACAGTG 


AAGATC C AAA 


2100 


TAAGAACTTC 


AAAGC GGATG 


AAGAGC C AGT 


AGAGGAAACA 


CCTGCTGAGC 


CAGAAGTCCC 


2160 


T C AAGT AG AG 


AC TGAAAAAG 


T AGAAGC CCA 


ACTCAAAGAA 


GCAGAAGTTT 


TGCTTGC GAA 


2220 


AGTAACGGAT 


TCTAGTCTGA 


AAGCCAATGC 


AACAGAAACT 


CTAGCTGGTT 


TAC GAAATAA 


2280 


TTTGACTCTT 


CAAATTATGG 


ATAACAATAG 


TATCATGGCA 


GAAGCAGAAA 


AATT AC TTGC 


2340 


GTTGTTAAAA 


GGAAGTAATC 


CTTCATCTGT 


AAGTAAGGAA 


AAAAT AAAC 




2389 


(2) INFORMATION FOR SEQ ID NO: 56 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 6 amino acids 

(B) TYPE: amino acid 



r 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Tyr Glu Leu Gly Leu Tyr Gin Ala Arg Thr Val Lys Glu Asn Asn 
15 10 15 

Arg Val Ser Tyr lie Asp Gly Lys Gin Ala Thr Gin Lys Thr Glu Asn 
20 25 30 

Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala Glu Gin 
35 40 45 

lie Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp 
50 55 60 

His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala lie lie Ser 
65 70 75 80 

Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Lys Leu Lys Asp Glu Asp 
85 90 95 

lie Val Asn Glu Val Lys Gly Gly Tyr Val lie Lys Val Asp Gly Lys 
100 105 110 

Tvr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg Thr 
115 120 125 

Lys Glu Glu lie Asn Arg Gin Lys Gin Glu His Ser Gin His Arg Glu 
130 135 140 

Gly Gly Thr Pro Arg Asn Asp Gly Ala Val Ala Leu Ala Arg Ser Gin 
145 150 155 160 

Gly Ara Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala Ser Asp lie 
165 170 175 

He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly Asp His Tyr 
180 185 190 

His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala 
195 200 205 

Glu Ala Phe Leu Ser Gly Arg Gly Asn Leu Ser Asn Ser Arg Thr Tyr 
210 215 220 

Arg Arg Gin Asn Ser Asp Asn Thr Ser Arg Thr Asn Trp Val Pro Ser 
225 230 235 240 

Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser- Asn 
245 250 255 

Thr Asn Ser Gin Ala Ser Gin Ser Asn Asp He Asp Ser Leu Leu Lys 
260 265 270 

Gin Leu Tyr Lys Leu Pro Leu Ser Gin Arg His Val Glu Ser Asp Gly 
275 ^280 285 

Leu Val Phe Asp Pro Ala Gin He Thr Ser Arg Thr Ala Arg Gly Val 
290 295 300 
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Ala Val Pro His Gly Asp His Tyr His Phe lie Pro Tyr Ser Gin Met 
305 310 315 320 

Ser Glu Leu Glu Glu Arg lie Ala Arg lie lie Pro Leu Arg Tyr Arg 
325 330 335 

Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro Ser Pro Gin 
340 345 350 

Pro Thr Pro Glu Pro Ser Pro Gly Pro Gin Pro Ala Pro Asn Leu Lys 
355 360 365 

lie Asp Ser Asn Ser Ser Leu Val Ser Gin Leu Val Arg Lys Val Gly 
370 375 380 

Glu Gly Tyr Val Phe Glu Glu Lys Gly lie Ser Arg Tyr Val Phe Ala 
385 390 395 400 

Lys Asp Leu Pro Ser Glu Thr Val Lys Asn Leu Glu Ser Lys Leu Ser 
405 410 415 

Lys Gin Glu Ser Val Ser His Thr Leu Thr Ala Lys Lys Glu Asn Val 
420 425 430 

Ala Pro Arg Asp Gin Glu Phe. Tyr Asp Lys Ala Tyr Asn Leu Leu Thr 
435 440 445 

Glu Ala His Lys Ala Leu Phe Xaa Asn Lys Gly Arg Asn Ser Asp Phe 
450 455 460 

Gin Ala Leu Asp Lys Leu Leu Glu Arg Leu Asn Asp Glu Ser Thr Asn 
465 470 475 480 

Lys Glu Lys Leu Val Asp Asp Leu Leu Ala Phe Leu Ala Pro lie Thr 
485 490 495 

His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gin lie Glu Tyr Thr Glu 
500 505 510 

Asp Glu Val Arg lie Ala Gin Leu Ala Asp Lys Tyr Thr Thr Ser Asp 
515 520 525 

Gly Tyr lie Phe Asp Glu His Asp lie lie Ser Asp Glu Gly Asp Ala 
530 535 540 

Tyr Val Thr Pro His Met Gly His Ser His Trp lie Gly Lys Asp Ser 
545 550 555 560 

Leu Ser Asp Lys Glu Lys Val Ala Ala Gin Ala Tyr Thr Lys Glu Lys 
565 570 575 

Gly lie Leu Pro Pro Ser Pro Asp Ala Asp Val Lys Ala Asn Pro Thr 
580 585 590 

Gly Asp Ser Ala Ala Ala lie Tyr Asn Arg Val Lys Gly Glu Lys Arg 
595 600 605 

lie Pro Leu Val Arg Leu Pro Tyr Met Val Glu His Thr Val Glu Val 
610 615 620 

Lys Asn Gly Asn Leu He He Pro His Lys Asp His Tyr His Asn He 
625 630 635 640 
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Lys Phe Ala Trp Phe Asp Asp His Thr Tyr Lys Ala Pro Asn Gly Tyr 
645 650 655 

Thr Leu Glu Asp Leu Phe Ala Thr lie Lys Tyr Tyr Val Glu His Pro 
660 665 670 

Asp Glu Arg Pro His Ser Asn Asp Gly Trp Gly Asn Ala Ser Glu His 
675 680 685 

Val Leu Gly Lys Lys Asp His Ser Glu Asp Pro Asn Lys Asn Phe Lys 
690 695 700 

Ala Asp Glu Glu Pro Val Glu Glu Thr Pro Ala Glu Pro Glu Val Pro 
705 710 715 720 

Gin Val Glu Thr Glu Lys Val Glu Ala Gin Leu Lys Glu Ala Glu Val 
725 730 735 

Leu Leu Ala Lys Val Thr Asp Ser Ser Leu Lys Ala Asn Ala Thr Glu 
740 745 750 

Thr Leu Ala Gly Leu Arg Asn Asn Leu Thr Leu Gin lie Met Asp Asn 
755 760 765 

Asn Ser lie Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys Gly 
770 775. 780 

Ser Asn Pro Ser Ser Val Ser Lys Glu Lys lie Asn 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TACTGAGATG CATCATAATC TAGGAGCTGA AAAGCGTTCA GCAGTGGCTA CTACTATCGA 60 

TAGTTTTAAG GAGC GAAGTC AAAAAGTCAG AGCACTATCT GATCCAAATG TGC GTTTTGT 120 

TCCCTTCTTT GGCTCTAGTG AATGGCTTCG TTTTGACGGT GCTCATTCTG CGGTATTAGC 180 

TGAGAAATAC AATCGTTCCT ACCGTCCTTA TCTTTTAGGA CAGGGGGGAG CTGCATCGCT 240 

TAACCAATAT TTTGGAATGC AACAGATGTT ACCACAGCTG GAGAATAAAC AAGTTGTGTA 3 00 

TGTTATCTCA CCTCAGTGGT TCAGTAAAAA TGGCTATGAT CCAGCAGCCT TCCAGCAGTA 3 60 

TTTTAATGGA GACCAGTTGA CTAGTTTTCT GAAACATCAA TCTGGGGATC AGGC TAGTC A 42 0 

ATATGCAGCG ACTC GCTTAC TGCAACAGTT CCCAAACGTA GCTATGAAGG AC C TGGTTC A 480 

GAAGTTGGCA AGTAAAGAAG AATTGTCGAC AGCAGACAAT GAAATGATTG AATTATTGGC 540 

TCGTTTTAAT GAACGCCAAG CTTCCTTTTT TGGTCAGTTT TC GGTTAGAG GCTATGTTAA 600 

C T AC G ATAAG CATGTAGCTA AGTATTTAAA AATCTTGCCA GACCAGTTTT CTTATCAGGC 660 
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AATAGAAGAT GTTGTCAAAG CAGATGCTGA AAAAAATACT TCCAATAATG AGATGGGAAT 72 0 

GGAAAATTAT TTCTATAATG AGCAGATCAA GAAGGATTTG AAGAAATTAA AGGATTC TC A 780 

GAAAAGC TTT ACCTATCTCA AGTCGCCAGA GTATAATGNN TTGCAGTTGG TTTTAACACA 84 0 

GTTTTC T AAA TCTAAGGTAA AC C C GATTTT TATCATTCCA CCTGTTAATA AAAAATGGAT 900 

GNACTATGCT GGTCTACGAG AGGATATGTA CCAACAAACG GTGCAGAAGA TTCGCTACCA 960 

GTTAGAAAGT CAAGGTTTTA CCAATATAGC AG ATTTTTC T AAGGACGGCG GGGAGC CTTT 102 0 

CTTTATGAAG GACACCATTC ACCTTGGTTG GTTGGGTTGG TTGGCTTTTG ACAAGGCAGT 1080 

TGATCCTTTC CTATCCAATC CCACACCAGC TCCGACTTAC CATCTGAATG AGCGCTTTTT 114 0 

CAGCAAAGAT TGGGCGACTT ATGATGGAGA TGTCAAAGAA 118 0 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Thr Glu Met His His Asn Leu Gly Ala Glu Lys Arg Ser Ala Val Ala 
15 10 15 

Thr Thr lie Asp Ser Phe Lys Glu Arg Ser Gin Lys Val Arg Ala Leu 
20 25 30 

Ser Asp Pro Asn Val Arg Phe Val Pro Phe Phe Gly Ser Ser Glu Trp 
35 40 45 

Leu Arg Phe Asp Gly Ala His Ser Ala Val Leu Ala Glu Lys Tyr Asn 
50 55 60 

Arg Ser Tyr Arg Pro Tyr Leu Leu Gly Gin Gly Gly Ala Ala Ser Leu 
65 70 .75 80 

Asn Gin Tyr Phe Gly Met Gin Gin Met Leu Pro Gin Leu Glu Asn Lys 
85 90 , 95 

Gin Val Val Tyr Val lie Ser Pro Gin Trp Phe Ser Lys Asn Gly Tyr 
100 105 110 

Asp Pro Ala Ala Phe Gin Gin Tyr Phe Asn Gly Asp Gin Leu Thr Ser 
115 . 120 125 

Phe Leu Lys His Gin Ser Gly Asp Gin Ala Ser Gin Tyr Ala Ala Thr 
130 135 140 

Arg Leu Leu Gin Gin Phe Pro Asn Val Ala Met Lys Asp Leu Val Gin 
145 150 155 160 

Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr Ala Asp Asn Glu Met lie 
165 170 175 
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Glu Leu Leu Ala Arg Phe Asn Glu Arg Gin Ala Ser Phe Phe Gly Gin 
180 185 190 

Phe Ser Val Arg Gly Tyr Val Asn Tyr Asp Lys His Val Ala Lys Tyr 
195 200 205 

Leu Lys He Leu Pro Asp Gin Phe Ser Tyr Gin Ala He Glu Asp Val 
210 215 220 

Val Lys Ala Asp Ala Glu Lys Asn Thr Ser Asn Asn Glu Met Gly Met 
225 230 235 240 

Glu Asn Tyr Phe Tyr Asn Glu Gin He Lys Lys Asp Leu Lys Lys Leu 
245 250 255 

Lys Asp Ser Gin Lys Ser Phe Thr Tyr Leu Lys Ser Pro Glu Tyr Asn 
260 265 270 

Xaa Leu Gin" Leu Val Leu Thr Gin Phe Ser Lys Ser Lys Val Asn Pro 
275 280 285 

He Phe He He Pro Pro Val Asn Lys Lys Trp Met Xaa Tyr Ala Gly 
290 295 300 

Leu Arg Glu Asp Met Tyr Gin Gin Thr Val Gin Lys He Arg Tyr Gin 
305 310 315 320 

Leu Glu Ser Gin Gly Phe Thr Asn He Ala Asp Phe Ser Lys Asp Gly 
325 330 335 

Gly Glu Pro Phe Phe Met Lys Asp Thr He His Leu Gly Trp Leu Gly 
340 345 350 

Trp Leu Ala Phe Asp Lys Ala Val Asp Pro Phe Leu Ser Asn Pro Thr 
355 360 365 

Pro Ala Pro Thr Tyr His Leu Asn Glu Arg Phe Phe Ser Lys Asp Trp 
370 375 380 

Ala Thr Tyr Asp Gly Asp Val Lys Glu 
385 390 

(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 59: 

GGTTTTGAGA AAGTATTTGC AGGGGGC C C T GATTGAGTCG ATTGAGCAAG TGGAAAATGA 60 

CCGTATTGTG GAAATTACAG TTTCCAATAA AAAC GAG ATT GG AG AC CAT A TCCAGGCTAC 12 0 

CTTGATTATC GAAATTATGG GGAAACACAG TAATATTCTA CTGGTCGATA AAAGCAGTCA 180 

TAAAATCCTC GAAGTTATCA AACACGTCGG CTTTTCACAA AATAGCTACC GCACCTTACT 240 

TCCAGGATCG AC CTATATC G CTCCGCCAAG TACAAAATCT CTCAATC CTT TTACTATCAA 3 00 



60 



GGATGAAAAG 


CTCTTTGAAA 


TCCTGCAAAC 


CCAAGAACTA 


ACAGCAAAAA 


ATCTTCAAAG 


360 


CCTCTTTCAA 


GGTCTGGGAC 


GCGATACGGC 


AAATGAATTG 


GAAAGGATAC 


TGGTTAGTGA 


420 


AAAACTTTCC 


GCTTTCCGAA 


ATTTTTTCAA 


TCAAGAAACC 


AAGCCATGCT 


TGACTGAGAC 


480 


TTCCTTCAGT 


CCAGTTCCTT 


TTGCAAATCA 


GGTGGGAGAG 


CCTTTTGCAA 


ATCTTTCTGA 


540 


TTTGTTGGAC 


AC C TACTATA 


AGGATAAGGC 


TGAGCGCGAC 


CGCGTCAAAC 


AGCAGGCCAG 


600 


TGAAC TGATT 


CGTCGTGTTG 


AAAATGAACT 


TCAGAAAAAC 


C GAC AC AAAC 


TCAAAAAACA 


660 


GGAAAAAGAG 


TTACTGGCGA 


CAGACAACGC 


TGAAGAATTT 


CGTCAAAAAG 


GAGAATTGCT 


720 


GACAACCTTC 


CTCCACCAAG 


TGCCTAACGA 


CCAAGACCAG 


GTT ATC C TAG 


ACAACTACTA 


780 


TACCAACCAA 


CCTATCATGA 


TTGCGCTTGA 


TAAGGCTCTG 


ACTCCCAACC 


AGAATGCCCA 


840 


ACGCTATTTT 


AAACGGTATC 


AGAAACTCAA 


AGAAGC TGTC 


AAATAC TTGA 


CTGATTTGAT 


900 


TGAAGAAACC 


AAAGCCACTA 


TTCTCTATCT 


GGAAAGTGTA 


GAAACCGTCC 


TCAACCAAGC 


960 


TGGACTGGAA 


GAAATCGCTG 


AAATCCGTGA 


AGAATTGATT 


CAAACAGGTT 


TTATCCGCAG 


1020 


AAGACAACGG 


GAG AAAATC C 


AGAAACGCAA 


AAAACTAGAA 


C AAT ATC TAG 


CAAGCGATGG 


1080 


CAAAACCATC 


ATCTATGTCG 


GACGAAACAA 


TCTTCAAAAT 


GAGGAATTGA 


CCTTTAAAAT 


1140 


GGCCCGCAAG 


GAGGAACTTT 


GGTTCCATGC 


TAAGGACATT 


CCTGGAAGCC 


ATGTTGTCAT 


1200 


CTCAGGAAAT 


CTTGACCCAT 


CTGATGCAGT 


CAAGACAGAC 


GCAGCAGAGT 


TAGCTGCCTA 


1260 


CTTCTCTCAA 


GGGCGCCTGT 


CGAATCTGGT 


GCAGGTAGAT 


ATGATTGAAG 


TCAAAAAACT 


1320 


CAATAAACCA 


ACTGGTGGAA 


AACCCGGCTT 


TGTC AC TTAC 


ACAGGACAAA 


AGACCCTCCG 


1380 


CGTCACACCA 


GACTCCAAAA 


AAATTGCATC 


CATGAAAAAA 


TCC 




1423 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Val Leu Ara Lys Tyr Leu Gin Gly Ala Leu lie Glu Ser lie Glu Gin 
1 5 10 15 

Val Glu Asn Asp Arg lie Val Glu lie Thr Val Ser Asn Lys Asn Glu 
20 25 30 

lie Gly Asp His lie Gin Ala Thr Leu lie lie Glu lie Met Gly Lys 

. 35 40 45 

His Ser Asn lie Leu Leu Val Asp Lys Ser Ser His Lys lie Leu Glu 
50 55 60 

Val lie Lys His Val Gly Phe Ser Gin Asn Ser Tyr Arg Thr Leu Leu 
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65 70 75 80 

Pro Gly Ser Thr Tyr lie Ala Pro Pro Ser Thr Lys Ser Leu Asn Pro 
85 90 95 

Phe Thr lie Lys Asp Glu Lys Leu Phe Glu lie Leu Gin Thr Gin Glu 
100 105 110 

Leu Thr Ala Lys Asn Leu Gin Ser Leu Phe Gin Gly Leu Gly Arg Asp 
115 120 125 

Thr Ala Asn Glu Leu Glu Arg lie Leu'Val Ser Glu Lys Leu Ser Ala 
130 135 140 

Phe Arg Asn Phe Phe Asn Gin Glu Thr Lys Pro Cys Leu Thr Glu Thr 
145 150 155 160 

Ser Phe Ser Pro Val Pro Phe Ala Asn Gin Val Gly Glu Pro Phe Ala 
165 170 175 

Asn Leu Ser Asp Leu Leu Asp Thr Tyr Tyr Lys Asp Lys Ala Glu Arg 
180 185 190 

Asp Arg Val Lys Gin Gin Ala Ser Glu Leu lie Arg Arg Val Glu Asn 
195 200 205 

Glu Leu Gin Lys Asn Arg His Lys Leu Lys Lys Gin Glu Lys Glu Leu 
210 215 220 

Leu Ala Thr Asp Asn Ala Glu Glu Phe Arg Gin Lys Gly Glu Leu Leu 
225 230 235 240 

Thr Thr Phe Leu His Gin Val Pro Asn Asp Gin Asp Gin Val lie Leu 
245 250 255 

Asp Asn Tyr Tyr Thr Asn Gin Pro lie Met lie Ala Leu Asp Lys Ala 
260 265 270 

Leu Thr Pro Asn Gin Asn Ala Gin Arg Tyr Phe Lys Arg Tyr Gin Lys 
275 280 285 

Leu Lys Glu Ala Val Lys Tyr Leu Thr Asp Leu lie Glu Glu Thr Lys 
290 295 300 

Ala Thr lie Leu Tyr Leu Glu Ser Val Glu Thr Val Leu Asn Gin Ala 
305 310 315 320 

Gly Leu Glu Glu He Ala Glu He Arg Glu Glu Leu He Gin Thr Gly 
325 330 335 

Phe He Arg Arg Arg Gin Arg Glu Lys lie Gin Lys Arg Lys Lys Leu 
340 345 350 

Glu Gin Tyr Leu Ala Ser Asp Gly Lys Thr He He Tyr Val Gly Arg 
355 360 365 

Asn Asn Leu Gin Asn Glu Glu Leu Thr Phe Lys Met Ala Arg Lys Glu 
370 375 380 

Glu Leu Trp Phe His Ala Lys Asp He Pro Gly Ser His Val Val He 
385 390 395 400 

Ser Gly Asn Leu Asp Pro 'Ser Asp Ala Val Lys Thr Asp Ala Ala Glu 
405 410 415 
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Leu Ala Ala Tyr Phe Ser Gin Gly Arg Leu Ser Asn Leu Val Gin Val 
420 425 430 

Asp Met lie Glu Val Lys Lys Leu Asn Lys Pro Thr Gly Gly Lys Pro 
435 440 445 

Gly Phe Val Thr Tyr Thr Gly Gin Lys Thr Leu Arg Val Thr Pro Asp 
450 455 460 

Ser Lys Lys lie Ala Ser Met Lys Lys Ser 
465 470 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GACAACATTT ACTATCCATA CAGTAGAGTC. AGCACCAGCA GAAGTGAAAG AAATTCTTGA 
AACAGTAGAA AAAGACAACA ATGGC TAT AT TCCCAACCTA ATCGGTCTCT TGGCCAATGC 
CCCGACTGTT TTAGAAGCCT ACCAAATTGT CTCATCTATC CACCGTCGCA ACAGCCTGAC 
ACCCGTTGAG CGTGAAGTGG TGCAAATCAC GGCAGCCGTG ACCAATGGTT GTGCCTTCTG 
TGTCGCAGGT CACACAGCCT TTTCCATCAA ACAAATCCAG ATGAATGATG ACTTGATTCA 
AGCTCTTCGC AATCGTACTC CAATTGAAAC AGATC CTAAA TTGG AT AC C C TAGCTAAGTT 
TACCTTGGCA GTTATCAATA CCAAGGGTCG TGTAGGAGAT GAAGCCTTGT CTGAGTTTTT 
AG AAGC TGGC TACACTCAAC AAAATGCCTT GGATGTGGTT TTTGGTGTCA GCCTAGCAAT 
CCTCTGTAAC TATGC C AAC A ACTTAGCTAA TACACCAATT AATCCAGAAT TGCAACCTTA 
TGCC 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Thr Thr Phe Thr He His Thr Val Glu Ser Ala Pro Ala Glu Val Lys 
1 5 10 15 

Glu He Leu Glu Thr Val Glu Lys Asp Asn Asn Gly Tyr He Pro Asn 
20 25 30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
544 
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Leu lie Gly Leu Leu Ala Asn Ala Pro Thr Val Leu Glu Ala Tyr Gin 
35 40 45 

lie Val Ser Ser lie His Arg Arg Asn Ser Leu Thr Pro Val Glu Arg 
50 55 60 

Glu Val Val Gin He Thr Ala Ala Val Thr Asn Gly Cys Ala Phe Cys 
65 70 75 80 

Val Ala Gly His Thr Ala Phe Ser He Lys Gin He Gin Met Asn Asp 
85 90 95 

Asp Leu He Gin Ala Leu Arg Asn Arg Thr Pro He Glu Thr Asp Pro 
100 105 110 

Lys Leu Asp Thr Leu Ala Lys Phe Thr Leu Ala Val He Asn Thr Lys 
115 120 125 

Gly Arg Val Gly Asp Glu Ala Leu Ser Glu Phe Leu Glu Ala Gly Tyr 
130 135 140 

Thr Gin Gin Asn Ala Leu Asp Val Val Phe Gly Val Ser Leu Ala He 
145 / 150 155 160 

Leu Cys Asn Tyr Ala Asn Asn Leu Ala Asn Thr Pro lie Asn Pro Glu 
165 170 175 

Leu Gin Pro Tyr Ala 
180 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GGCTAAGGAA AGAGTGGATG TACTAGCTTA TAAACAGGGG TTGTTTGAAA CGAGAGAGCA 60 

GGCCAAGCGA GGTGTGATGG CTGGCCTAGT CGTAGCAGTC CTTAATGGAG AAC GGTTTGA 12 0 

CAAGCCAGGA GAGAAAATTC CAGATGACAC CGAATTAAAA CTCAAGGGGG AGAAACTCAA 180 

GTATGTCAGC CGTGGTGGTT TGAAACTGGA AAAGGCCTTG CAGGTCTTTG ATTTGTCGGT 240 

GGATGGCGCG ACT AC GATTG ATATCGGGGC CTCTACTGGA GGTTTTACCG ATGTCATGCT 3 00 

ACAGAATAGT GCCAAGTTGG TCTTTGCAGT CGATGTTGGT ACCAATCAGT TGGCTTGGAA 3 60 

ATT AC GC C AA GACCCACGAG TTGTCAGCAT GGAGCAGTTC AATTTCCGCT ATGC TGAAAA 42 0 
GAC TGATTTC GAGCAGGAGC CGAGCTTTGC CAGTATTGAT GTGAGTTTCA TTTCCCTTAG ' 48 0 

TCTGATTTTG CCAGCCTTGC ACCGTGTCTT GGCTGATCAA GGTCAGGTGG TAGC AC TTGT 54 0 

CAAACCTCAG TTTGAGGCAG GACGTGAGCA GATTGGGAAA AATGGAATTA TTC GAGATGC 600 

TAAGGTTCAT CAGAATGTCC TTGAATCTGT AACAGCTATG GCAGTAGAGG TAGGTTTTTC 66 0 
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' AGTCCTTGGC TTGGACTTTT CTCCCATCCA AGGTGGACAT GGAAATATTG AATTTTTAGC 72 0 

GTATTTGAAA AAAGAAAAGT CAGCAAGCAA TCAGATTCTT GCTGAGATTA AAGAAGCAGT 7 80 

AGAGAGGGCG CATAGTCAAT TT AAAAAT G A A 811 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Ala Lys Glu Arg Val Asp Val Leu Ala Tyr Lys Gin Gly Leu Phe Glu 
1 5 10 15 

Thr Arg Glu Gin Ala Lys Arg Gly Val Met Ala Gly Leu Val Val Ala 
20 25 30 

Val Leu Asn Gly Glu Arg Phe. Asp Lys Pro Gly Glu Lys lie Pro Asp 
35 40 45 

Asp Thr Glu Leu Lys Leu Lys Gly Glu Lys Leu Lys Tyr Val Ser Arg 
50 55 - 60 

Gly Gly Leu Lys Leu Glu Lys Ala Leu Gin Val Phe Asp Leu Ser Val 
65 70 75 80- 

Asp Gly Ala Thr Thr lie Asp lie Gly Ala Ser Thr Gly Gly Phe Thr 
85 90 95 

Asp Val Met Leu Gin Asn Ser Ala Lys Leu Val Phe Ala Val Asp Val 
100 105 110 

Gly Thr Asn Gin Leu Ala Trp Lys Leu Arg Gin Asp Pro Arg Val Val 
115 120 125 

Ser Met Glu Gin Phe Asn Phe Arg Tyr Ala Glu Lys Thr Asp Phe Glu 
130 135 140 

Gin Glu Pro Ser Phe Ala Ser lie Asp Val Ser Phe lie Ser Leu Ser 
145 150 155 160 

Leu lie Leu Pro Ala Leu His Arg Val Leu Ala Asp Gin Gly Gin Val 
165 170 175 

Val Ala Leu Val Lys Pro Gin Phe Glu Ala Gly Arg Glu Gin lie Gly 
180 185 190 

Lys' Asn Gly lie lie Arg Asp Ala Lys Val His Gin Asn Val Leu Glu 
195 200 205 

Ser Val Thr Ala Met Ala Val Glu Val Gly Phe Ser Val Leu Gly Leu 
210 215 220 

Asp Phe Ser Pro lie Gin Gly Gly His Gly Asn lie Glu Phe Leu Ala 
225 230 235 240 
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Tyr Leu Lys Lys Glu Lys Ser Ala Ser Asn Gin lie Leu Ala Glu lie 
245 250 255 

Lys Glu Ala Val Glu Arg Ala His Ser Gin Phe Lys Asn Glu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO : 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 9 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



TTGTTCCTAT 


GAACTTGGTC 


GTCACCAAGC 


TGGTCAGGTT 


AAGAAAGAGT 


CTAATCGAGT 


60 


TTC TT AT AT A 


GATGGTGATC 


AGGCTGGTCA 


AAAGGCAGAA 


AACTTGACAC 


CAGATGAAGT 


120 


CAGTAAGAGG 


GAGGGGATCA 


ACGCCGAACA 


AATNGTNATC 


AAGATT AC GG 


ATCAAGGTTA 


180 


TGTGACCTCT 


CATGGAGACC 


ATTATCATTA 


CTATAATGGC 


AAGGTTCCTT 


ATGATGCCAT 


240 


CATCAGTGAA 


GAGCTCCTCA 


TGAAAGATC C 


GAATTATCAG 


TTGAAGGATT 


CAGAC ATTGT 


300 


CAATGAAATC 


AAGGGTGGTT 


ATGTCATTAA 


GGTAAACGGT 


AAATAC TATG 


TNT AC C TT AA 


360 


GGATGCAGCT 


CATGCGGATA 


AT ATTC GGAC 


AAAAGAAGAG 


ATTAAAC GTC 


AGAAGCAGGA 


420 


ACGCAGTCAT 


AATCATAACT 


CAAGAGCAGA 


TAATGCTGTT 


GCTGCAGCCA 


GAGCCCAAGG 


480 


ACGTTATACA 


AC GGATGATG 


GGTATATCTT 


CAATGCATCT 


GATATCATTG 


AGG AC AC GGG 


540 


TGATGCTTAT 


ATCGTTCCTC 


ACGGCGACCA 


TTACCATTAC 


ATTCCTAAGA 


ATGAGTTATC 


600 


AGCTAGCGAG 


TTAGCTGCTG 


CAGAAGCCTA 


TTGGAATGGG 


AAGCAGGGAT 


CTCGTCCTTC 


660 


TTCAAGTTCT 


AGTTATAATG 


CAAATCCAGC 


TCAACCAAGA 


TTGTCAGAGA 


ACCACAATCT 


720 


GACTGTC AC T 


CCAACTTATC 


ATCAAAATCA 


AGGGGAAAAC 


ATTTCAAGCC 


TTTTACGTGA ' 


780 


ATTGTATGCT 


AAACCCTTAT 


CAGAACGCCA 


TGTGGAATCT 


GATGGC CTTA 


TTTTCGACCC 


840 


AGCGCAAATC 


ACAAGTCGAA 


CCGCCAGAGG 


TGTAGCTGTC 


CCTCATGGTA 


ACCATTACCA 


900 


CTTTATCCCT 


TATGAACAAA 


TGTCTGAATT 


GGAAAAAC G A 


ATTGCTCGTA 


TT ATTC C C C T 


960 


TCGTTATCGT 


TCAAACCATT 


GGGTACCAGA 


TTCAAGACCA 


GAACAACCAA 


GTCCACAATC 


1020 


GACTCCGGAA 


CCTAGTCCAA 


GTCCGCAACC 


TGCACCAAAT 


CCTCAACCAG 


CTCCAAGCAA 


1080 


TCCAATTGAT 


GAGAAATTGG 


TCAAAGAAGC 


TGTTCGAAAA 


GTAGGCGATG 


GTTATGTCTT 


1140 


TGAGGAGAAT 


GGAGTTTCTC 


GTTATATCCC 


AGCCAAGGAT 


CTTTCAGCAG 


AAAC AGCAGC 


1200 


AGGCATTGAT 


AGCAAACTGG 


CCAAGCAGGA 


AAGTTTATCT 


CATAAGCTAG 


GAGCTAAGAA 


1260 


AACTGACCTC 


CCATCTAGTG 


ATC GAGAATT 


TTACAATAAG 


GCTTATGACT 


TACTAGCAAG 


1320 


AATTCACCAA 


GATTTACTTG 


ATAATAAAGG 


TCGACAAGTT 


GATTTTGAGG 


CTTTGGATAA 


1380 
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CCTGTTGGAA C GACTCAAGG ATGTCNCAAG TGATAAAGTC AAGTTAGTGG ANGATATTCT 1440 

TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA AAACCAAATG CGCAAATTAC 150 0 

CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC AAGTACACAA CAGAAGACGG 15 6 0 

TTATATCTTT GATCCTCGTG ATATAAC C AG TGATGAGGGG GATGCCTATG TAACTCCACA 162 0 

TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT GAAGC TG AG A GAGCGGCAGC 16 80 

CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG AC AGAC CATC AGGATTCAGG 17 4 0 

AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC GTGAAAGCAG CTAAGAAGGT 1800 

GCCACTTGAT CGTATGCCTT AC AATC TTC A AT AT AC TGT A GAAGTCAAAA ACGGTAGTTT 18 60 

AATC AT AC C T C ATTATGAC C ATTACCATAA CATCAAATTT GAGTGGTTTG ACGAAGGCCT 192 0 

TTATGAGGCA CCTAAGGGGT AT AC TC TTGA GGATCTTTTG GCGACTGTCA AGTACTATGT 1980 

CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT GGTAACGCTA GCGACCATGT 2 040 

TCAAAGAAAC AAAAATGGTC AAGC TG AT AC CAATCAAACG GAAAAACCAA GCGAGGAGAA 2100 

ACCTCAGACA GAAAAAC C TG AGGAAGAAAC CCCTCGAGAA GAGAAACCGC AAAGC GAGAA 2160 

AC C AGAGTCT CCAAAACCAA CAGAGGAACC AGAAGAATCA CCAGAGGAAT CAGAAGAACC 2 22 0 

TCAGGTCGAG ACTGAAAAGG TTGAAGAAAA ACTGAGAGAG GCTGAAGATT T AC TTGG AAA 2 2 80 
AATCCAGGAT 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

! 

♦ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Cvs Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val Lys Lys Glu 
.1 5 10 15 

Ser Asn Arg Val Ser Tyr lie Asp Gly Asp Gin Ala Gly Gin Lys Ala 
20 25 30 

Glu' Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala 
35 40 45 

Glu Gin Xaa Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His 
50 55 60 

Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala lie 
65 70 75 80 

lie Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp 
85 90 95 

Ser Asp lie Val Asn Glu lie Lys Gly Gly Tyr Val lie Lys Val Asn 
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100 105 HO 

Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn He 
115 . 120 125 

Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu Arg Ser His Asn 
130 135 140 

His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gin Gly 
145 150 155 160 

Arq Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn Ala Ser Asp He He 
165 170 175 

Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly Asp His Tyr His 
180 185 190 

Tvr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala Glu 
195 200 205 

Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser Ser Ser Ser 
210 215 220 

Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn His Asn Leu 
225 230 235 240 

Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn He Ser Ser 
245 250 255 

Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu 
260 265 270 

Ser Asp Gly Leu He Phe Asp Pro Ala Gin He Thr Ser Arg Thr Ala 
275 280 285 

Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe He Pro Tyr 
290 295 300 

Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg He He Pro Leu 
305 310 315 320 

Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro 
325 330- 335 

Ser Pro Gin Ser Thr Pro Glu Pro Ser Pro Ser Pro Gin Pro Ala Pro 
340 345 350 

Asn Pro Gin Pro Ala Pro Ser Asn Pro He Asp Glu Lys Leu Val Lys 
355 360 365 

Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly 
370 375 380 

Val Ser Arg Tyr He Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala 
385 390 395 400 

Gly He Asp Ser Lys Leu Ala Lys Gin Glu Ser Leu Ser His Lys Leu 
405 410 41b 

Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn 
420 425 430 

Lys Ala Tyr Asp Leu Leu Ala Arg He His Gin Asp Leu Leu Asp Asn 
435 440 445 
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Lvs Gly Arg Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg 
450 455 460 

Leu Lys Asp Val Xaa Ser Asp Lys Val Lys Leu Val Xaa Asp He Leu 
465 470 475 480 

Ala Phe Leu Ala Pro He Arg His Pro Glu Arg Leu Gly Lys Pro Asn 
485 490 495 

Ala Gin He Thr Tyr Thr Asp Asp Glu He Gin Val Ala Lys Leu Ala 
500 505 510 

Gly Lys Tyr Thr Thr Glu Asp Gly. Tyr He Phe Asp Pro Arg Asp He 
515 520 525 

Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser 
530 535 540 

His Trp He Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala 
545 550 555 560 

Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His 
565 570 • 575 

Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala He Tyr Asn 
580 585 . 590 

Arg val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn 
595 600 605 

Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu He He Pro His 
610 615 620 

Tyr Asp His Tyr His Asn He Lys Phe Glu Trp Phe Asp Glu Gly Leu 
625 630 635 640 

Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val 
645 650 655 

Lvs Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly 
660 665 670 

Phe Gly Asn Ala Ser Asp His Val Gin Arg Asn Lys Asn Gly Gin Ala 
675 680 685 

Asp Thr Asn Gin Thr Glu Lys Pro Ser Glu Glu Lys Pro Gin Thr Glu 
690 695 700 

Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gin Ser Glu Lys 
705 710 715 720 

Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Ser Pro Glu Glu 
725 730 735 

Ser Glu Glu Pro Gin Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg 
740 745 . 750 

Glu Ala Glu Asp Leu Leu Gly Lys He Gin Asp 
755 760i 

INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 3 52 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
TTATAAGGGT GAATTAGAAA AAGGATAC C A ATTTGATGGT TGGGAAATTT CTGGTTTCGA 6 0 

AGGTAAAAAA GACGCTGGCT ATGTTATTAA TCTATCAAAA GATACCTTTA TAAAACCTGT 12 0 

ATTCAAGAAA ATAGAGGAGA AAAAGGAG G A AGAAAATAAA CCTACTTTTG ATGTATC GAA 18 0 

AAAGAAAGAT AACCCACAAG TAAAC CAT AG TCAATTAAAT GAAAGTCACA GAAAAGAGGA 2 40 

TTTACAAAGA GAAGAGCATT C AC AAAAATC TGATTCAACT AAGGATGTTA CAGCTACAGT 3 00 

TCTTGATAAA AACAATATCA GTAGTAAATC AACTAC TAAC AATC CTAATA AG 3 52 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Tvr Lys Gly Glu Leu Glu Lys Gly Tyr Gin Phe Asp Gly Trp Glu lie 
/ 5 10 15 

Ser Gly Phe Glu Gly Lys Lys Asp Ala Gly Tyr Val lie Asn Leu Ser 
20 25 30 

Lys Asp Thr Phe lie Lys Pro Val Phe Lys Lys lie Glu Glu Lys Lys 
35 40 45 

Glu Glu Glu Asn Lys Pro Thr Phe Asp Val Ser Lys Lys Lys Asp Asn 
50 55 60 

Pro Gin Val Asn His Ser Gin Leu Asn Glu Ser His Arg Lys Glu Asp 
65 70 75 80 

Leu Gin Arg Glu Glu His Ser Gin Lys Ser Asp Ser Thr Lys Asp Val 
85 90. 95 

Thr Ala Thr Val Leu Asp Lys Asn Asn lie Ser Ser Lys Ser Thr Thr 
100 105 110 

Asn Asn Pro Asn Lys 
115 

(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69.: 



GAATGTTCAG 


.GCTCAAGAAA 


GTTCAGGAAA 


TAAAATCCAC 


TTTATCAATG 


TTCAAGAAGG 


60 


TGGCAGTGAT 


GCGATTATTC 


TTGAAAGCAA 


TGGACATTTT 


GCCATGGTGG 


ATACAGGAGA 


120 


AGATTATGAT 


TTCCCAGATG 


GAAGTGATTC 


TCGCTATCCA 


TGGAGAGAAG 


GAATTGAAAC 


180 


GTC TTATAAG 


CATGTTCTAA 


CAGACCGTGT 


CTTTCGTCGT 


TTGAAGGAAT 


TGGGTGTC C A 


240 


AAAACTTGAT 


TTTATTTTGG 


TGACCCATAC 


CCACAGTGAT 


CATATTGGAA 


ATGTTGATGA 


300 


ATT AC T GTC T 


ACCTATCCAG 


TTGACCGAGT 


CTATCTTAAG 


AAATATAGTG 


ATAGTC GTAT 


360 


TACTAATTCT 


GAAC GTCTAT 


GGGATAATCT 


GTATGGCTAT 


GATAAGGTTT 


TACAGACTGC 


420 


TGCAGAAAAA 


GGTGTTTCAG 


TTATTCAAAA 


TATCACACAA 


GGGGATGCTC 


ATTTTCAGTT 


480 


TGGGGACATG 


GATATTCAGC 


TCTATAATTA 


TGAAAATGAA 


ACTGATTCAT 


CGGGTGAATT 


540 


AAAGAAAATT 


TGGGATGACA 


ATTC C AATTC 


CTTGATTAGC 


GTGGTGAAAG 


TCAATGGCAA 


600 


GAAAATTTAC 


CTTGGGGGCG 


ATTTAGATAA 


TGTTCATGGA 


GCAGAAGACA 


AGTATGGTCC 


660 


TCTCATTGGA 


AAAGTTGATT 


TGATGAAGTT 


T AAT CATC AC 


CATGATACCA 


ACAAATCAAA 


720 


TACCAAGGAT 


TTCATTAAAA 


ATTTGAGTCC 


GAGTTTGATT 


GTTCAAACTT 


CGGATAGTCT 


780 


ACCTTGGAAA 


AATGGTGTTG 


ATAGTGAGTA 


TGTTAATTGG 


CTCAAAGAAC 


GAGGAATTGA 


840 


GAGAATCAAC 


GCAGCCAGCA 


AAG AC T ATGA 


TGCAACAGTT 


TTTGATATTC 


GAAAAGACGG 


900 


TTTTGTCAAT 


ATTTCAACAT 


CCTACAAGCC 


GATTCCAAGT 


TTTCAAGCTG 


GTTGGCATAA 


960 


GAGTGCATAT 


GGGAACTGGT 


GGTATCAAGC 


GCCTGATTCT 


ACAGGAGAGT 


ATGCTGTCGG 


1020 


TTGGAATGAA 


ATCGAAGGTG 


. AATGGTATTA 


CTTTAACCAA 


ACGGGTATCT 


TGTTACAGAA 


1080 


TCAATGGAAA 


AAATGGAACA 


ATCATTGGTT 


CTATTTGACA GACTCTGGTG 


CTTCTGCTAA 


1140 


AAATTGGAAG 


AAAATCGCTG 


GAATC TGGT A 


TTATTTTAAC 


AAAG AAAAC C 


AGATGGAAAT 


1200 


TGGTTGGATT 


CAAGATAAAG 


AGCAGTGGTA 


TTATTTGGAT 


GTTGATGGTT 


C T ATGAAG AC 


1260 


AGGATGGCTT 


CAATATATGG 


GGCAATGGTA 


TTACTTTGCT 


CCATCAGGGG 


AA 


1312 


(2) INFORMATION FOR SEQ ID NO: 70 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Asn Val Gin Ala Gin Glu Ser Ser Gly Asn Lys He His Phe He Asn 
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1 5 10 15 

Val Gin Glu Gly Gly Ser Asp Ala He He Leu Glu Ser Asn Gly His 
20 25 30 

Phe Ala Met Val Asp Thr Gly Glu Asp Tyr Asp Phe Pro Asp Gly Ser 
35 40 45 

Asp Ser Arg Tyr Pro Trp Arg Glu Gly He Glu Thr Ser Tyr Lys His 
50 55 60 

Val Leu Thr Asp Arg Val Phe Arg Arg Leu Lys Glu Leu Gly Val Gin 
65 70 75 80 

Lvs Leu Asp Phe He Leu Val Thr His. Thr His Ser Asp His He Gly 
85 90 95 

Asn Val Asp Glu Leu Leu Ser Thr Tyr Pro Val Asp Arg Val- Tyr Leu 
100 105 HO 

Lvs Lvs Tyr Ser Asp Ser Arg He Thr Asn Ser Glu Arg Leu Trp Asp 
115 120 125 

Asn Leu Tyr Gly Tyr Asp Lys Val Leu Gin Thr Ala Ala Glu Lys Gly 
130 135 140 

Val Ser Val lie Gin Asn He Thr Gin Gly Asp Ala His Phe Gin Phe 
145 150 155 160 

Gly Asp Met Asp He Gin Leu Tyr Asn Tyr Glu Asn Glu Thr Asp Ser 
165 170 175 

Ser Glv Glu Leu Lys Lys He Trp Asp Asp Asn Ser Asn Ser Leu He 
180 185 190 

Ser Val Val Lys Val Asn Gly Lys Lys He Tyr Leu Gly Gly Asp Leu 
195 200 205 

Asp Asn Val His Gly Ala Glu Asp Lys Tyr Gly Pro Leu lie Gly Lys 
210 215 220 

Val Asp Leu Met Lys Phe Asn His His His Asp Thr Asn Lys Ser Asn 
225 230 235 240 

Thr Lvs Asp Phe He Lys Asn Leu Ser Pro Ser Leu lie Val Gin Thr 
245 250 255 

Ser Asp Ser Leu Pro Trp Lys Asn Gly Val Asp Ser Glu Tyr Val Asn 
260 265 270 

Trp Leu Lys Glu Arg 'Gly He Glu Arg He Asn Ala Ala Ser Lys Asp 
275 280 285 

Tyr Asp Ala Thr Val Phe Asp He Arg Lys Asp Gly Phe Val Asn He 
290 295 300 

Ser Thr Ser Tyr Lys Pro He Pro Ser Phe Gin Ala Gly Trp His Lys 
305 310 315 320 

Ser Ala Tyr Gly Asn Trp Trp Tyr Gin Ala Pro Asp Ser Thr Gly Glu 
325 330 335 

Tvr Ala Val Gly Trp Asn Glu He Glu Gly Glu Trp Tyr Tyr Phe Asn 
340 345 350 
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Gin Thr Gly He Leu Leu Gin Asn Gin Trp Lys Lys Trp Asn Asn His 
355 360 365 

Trp Phe Tyr Leu Thr Asp Ser Gly Ala Ser Ala Lys Asn Trp Lys Lys 
370 375 380 

He Ala Gly He Trp Tyr Tyr Phe Asn Lys Glu Asn Gin Met Glu He 
385 390 395 400 

Gly Trp He Gin Asp Lys Glu Gin Trp Tyr Tyr Leu Asp Val Asp Gly 
405 410 415 

Ser Met Lys Thr Gly Trp Leu Gin Tyr Met Gly Gin Trp Tyr Tyr Phe 
420 425 430 

Ala Pro Ser Gly Glu 
435 

(2) INFORMATION FOR SEQ ID NO : 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: ' SEQ ID NO: 71: 

CTTGGGTGTA ACCCATATCC AGCTCCTTCC AGTCTTGTCT TACTACTTTG TCAATGAATT 60 

GAAAAAC CAT GAACGCTTGT CTGACTACGC TTCAAGCAAC AGCAACTACA AC TGGGG AT A 12 0 

TGACCCTCAA AACTACTTCT CCTTGACTGG TATGTACTCA AGCGATCCTA AGAATCCAGA 180. 

AAAACGAATC GCAGAATTTA AAAAC CTC AT CAACGAAATC CACAAACGTG GTATGGGAGC 240 

TATCCTAGAT GTCGTTTATA ACCACACAGC CAAAGTCGAT CTCTTTGAAG ATTTGGAACC 300 

AAAC TACT AC CACTTTATGG ATGCCGATGG CACACCTCGA ACTAGCTTTG GTGGTGGACG 360 

C TTGGGGAC A ACCCACCATA TGACCAAACG GCTCCTAATT GACTCTATCA AATACCTAGT 42 0 

TGATACCTAC AAAGTGGATG GCTTCCGTTT C GAT ATGATG GGAGAC C ATG ACGCCGCTTC 480 

TATCGAAGAA GCTTACAAGG CTGCACGCGC CCTCAATCCA AACCTCATCA TGCTTGGTGA 540 

AGGTTGGAGA ACCTATGCCG GTGATGAAAA CATGCCTACT AAAGCTGCTG AC C AAGATTG 600 

GATGAAACAT AC C GAT AC TG TCGCTGTCTT TTCAGATGAC ATCCGTAACA ACCTCAAATC 660 

TGGTTATC C A AACGAAGGTC AACCTGCCTT TATCACAGGT GGCAAGCGTG ATGTCAACAC 720 

CATCTTTAAA AATCTCATTG CTCAACCAAC TAACTTTGAA GCTGACAGCC CTGGAGATGT 780 

CATCCAATAC ATCGCAGCCC ATGATAACTT GACCCTCTTT GACATCATTG CCCAGTCTAT 840 

CAAAAAAGAC CCAAGCAAGG CTGAGAACTA TGC TGAAATC CACCGTCGTT TACGACTTGG 900 

AAATCTCATG GTCTTGACAG CTCAAGGAAC TCCATTTATC CACTCCGGTC AGGAATATGG 960 

ACGTACTAAA CAATTCCGTG ACCCAGCCTA CAAGACTCCA GTAGCAGAGG ATAAGGTTCC 1020 
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AAACAAATCT CACTTGTTGC GTGATAAGGA CGGCAACCCA TTTGACTATC CTTACTTCAT 1080 

CCATGACTCT TACGATTCTA GTGATGCAGT CAACAAGTTT GACTGGACTA AGGCTACAGA 1140 

TGGTAAAGCT TATCCTGAAA ATGTCAAGAG CCGTGACTAT ATGAAAGGTT TGATTGCCCT 1200 

TCGTCAATCT AC AGATGCC T TCCGACTTAA GAGTCTTCAA GATATCAAAG ACCGTGTCCA 1260 

CCTCATCACT GTCCCAGGCC AAAATGGTGT GGAAAAAGAG GATGTAGTGA TTGGCTACCA 132 0 

AATCACTGCT CCAAACGGCG ATATCTACGC AGTCTTTGTC AATGCGGATG AAAAAGCTCG 138 0 

CGAATTTAAT TTGGGAACTG CCTTTGCACA TCTAAGAAAT GCGGAAGTTT TGGCAGATGA 1440 
AAACCAAGCA GGACCAGTCG GAATTGC C AA CCCGAAAGGA C TTGAATGG A CTGAAAAAGG - 150 0 

CTTGAAATTG AATGCCCTTA CAGCTACTGT TCTTCGAGTC TCTCAAAATG GAACTAGCCA 1560 

TGAGTCAACT GCAGAAGAGA AACCAGACTC AACCCCTTCC AAGCCTGAAC ATCAAAATGA 162 0 

AGCTTCTCAC CCTGCACATC AAGACCCAGC TCCAGAAGCT AGACCTGATT CTACTAAACC 1680 

AG ATGC C AAA GTAGCTGATG CGGAAAATAA ACCTAGCCAA GCTACAGCTG ATTCACAAGC 1740 

TGAACAACCA GCACAAGAAG CACAAGCATC. ATC TGTAAAA GAAGCGGTTC GAAACGAATC 1800 

GGTAGAAAAC TCTAGCAAGG AAAAT AT AC C TGCAACCCCA GATAAACAAG CTGAA 1855 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Leu Gly Val Thr His He Gin Leu Leu Pro Val Leu Ser Tyr Tyr Phe 
1 5 10 15 

Val Asn Glu Leu Lys Asn His Glu Arg Leu Ser Asp Tyr Ala Ser Ser 
20 25 30 

Asn Ser Asn Tyr Asn Trp Gly Tyr Asp Pro Gin Asn Tyr Phe Ser Leu 
35 40 45 

Thr Gly Met Tyr Ser Ser Asp Pro Lys Asn Pro Glu Lys Arg He Ala 
50 55 60 

Glu Phe Lys Asn Leu He Asn Glu He His Lys Arg Gly Met Gly Ala 
65 70 75 80 

He Leu Asp Val Val Tyr Asn His Thr Ala Lys Val Asp Leu Phe Glu 
85 90 95 

Asp Leu Glu Pro Asn Tyr Tyr His Phe Met Asp Ala Asp Gly Thr Pro 
100 105 HO 

Arg Thr Ser Phe Gly Gly Gly Arg Leu Gly Thr Thr His His Met Thr 
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115 120 125 

Lys Arg Leu Leu He Asp Ser He Lys Tyr Leu Val Asp Thr Tyr Lys 
130 135 140 

Val Asp Gly Phe Arg Phe Asp Met Met Gly Asp His Asp Ala Ala Ser 
145 150 155 160 

He Glu Glu Ala Tyr Lys Ala Ala Arg Ala Leu Asn Pro Asn Leu He 
165 170 175 

Met Leu Gly Glu Gly Trp Arg Thr Tyr Ala Gly Asp Glu Asn Met Pro 
180 185 190 

Thr Lys Ala Ala Asp Gin Asp Trp Met Lys His Thr Asp Thr Val Ala 
195 200 205 

Val Phe Ser Asp Asp He Arg Asn Asn Leu Lys Ser Gly Tyr Pro Asn 
210 215 220 

Glu Gly Gin Pro Ala Phe He Thr Gly Gly Lys Arg Asp Val Asn Thr 
225 230 235 240 

He Phe Lys Asn Leu He Ala Gin Pro Thr Asn Phe Glu Ala Asp" Ser 
.245 250 255 

Pro Gly Asp Val He Gin Tyr He Ala Ala His Asp Asn Leu Thr Leu 
260 265 270 

Phe Asp He He Ala Gin Ser He Lys Lys Asp Pro Ser Lys Ala Glu 
275 280 285 

Asn- Tyr Ala Glu He His Arg Arg Leu Arg Leu Gly Asn Leu Met Val 
290 295 300 

Leu Thr- Ala Gin Gly Thr Pro Phe He His Ser Gly Gin Glu Tyr Gly 
305 310 315 320 

Arg Thr Lys Gin Phe Arg Asp Pro Ala Tyr Lys Thr Pro Val Ala Glu 
325 330 335 

Asp Lys Val Pro Asn Lys Ser His Leu Leu Arg Asp Lys Asp Gly Asn 
340 345 . 350. 

Pro Phe Asp Tyr Pro Tyr Phe He His Asp Ser Tyr Asp Ser Ser Asp 
355 360 365 

Ala Val Asn Lys Phe Asp Trp Thr Lys Ala Thr Asp Gly Lys Ala Tyr 
370 375 380 

Pro Glu Asn Val Lys Ser Arg Asp Tyr Met Lys Gly Leu lie Ala Leu 
385 390 395 400 

Arg Gin Ser Thr Asp Ala Phe Arg Leu Lys Ser Leu Gin Asp lie Lys 
405 410 415 

Asp Arg Val His Leu He Thr Val Pro Gly Gin Asn Gly Val Glu Lys 
420 425 430 

Glu Asp Val Val He Gly Tyr Gin He Thr Ala Pro Asn Gly Asp lie 
435 440 445 

Tvr Ala Val Phe Val Asn Ala Asp Glu Lys Ala Arg Glu Phe Asn Leu 
450 455 460 
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Glv Thr Ala Phe Ala His Leu Arg Asn Ala Glu Val Leu Ala Asp Glu 
465 470 475 480 

Asn Gin Ala Gly Pro Val Gly He Ala Asn Pro Lys Gly Leu Glu Trp 
485 490 495 

Thr Glu Lys Gly Leu Lys Leu Asn Ala Leu Thr Ala Thr Val Leu Arg 
500 505 510 

Val Ser Gin Asn Gly Thr Ser His Glu Ser Thr Ala Glu Glu Lys Pro 
515 520 525 

Asp Ser Thr Pro Ser Lys Pro Glu His Gin Asn Glu Ala Ser His Pro 
530 535 540 

Ala His Gin Asp Pro Ala Pro Glu Ala Arg Pro Asp Ser Thr Lys Pro 
545 550 555 560 

Asp Ala Lys Val Ala Asp Ala Glu Asn Lys Pro Ser Gin Ala Thr Ala 
. 565 570 575 

Asp Ser Gin Ala Glu Gin Pro Ala Gin Glu Ala Gin Ala Ser Ser Val 
580 585 590 

Lys Glu Ala Val Arg Asn Glu Ser Val Glu Asn Ser Ser Lys Glu Asn 
595 600 605 

He Pro Ala Thr Pro Asp Lys Gin Ala Glu 
610 615 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 74 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 3-: 

TAGTGATGGT ACTTGGCAAG GAAAACAGTA TCTGAAAGAA GATGGCAGTC AAGCAGCAAA 60 

TGAGTGGGTT TTNGATACTC ATTATCAATC TTGGTTCTAT ATAAAAGCAG ATGCTAACTA 12 0 

TGCTGAAAAT GAATGGCTAA AGCAAGGTGA CGACTATTTT TACCTCAAAT CTGGTGGCTA 18 0 

T ATGGC C AAA TCAGAATGGG TAGAAGACAA GGGAGCCTTT TATTATCTTG AC C AAG ATGG " 240 

AAAGATGAAA AGAAATGCTT GGGTAGGAAC TTCCTATGTT GGTGCAACAG GTGC C AAAGT 3 00 

AATAGAAGAC TGGGTCTATG ATTCTCAATA CGATGCTTGG TTTTATATCA AAGCAGATGG 3 60 

ACAGCACGCA GAGAAAGAAT GGCTCCAAAT TAAAGGGAAG G AC T ATT ATT TCAAATCCGG 42 0 

TGGTTATCTA CTGACAAGTC AGTGGATTAA TCAAGCTTAT GTGAATGCTA GTGGTGCCAA 480 

AGTACAGCAA GGTTGGCTTT TTGACAAACA ATACCAATCT TGGTTTTACA TCAAAGAAAA 540 

TGGAAACTAT GCTGATAAAG AATGGATTTT C G AG AATGGT C AC T ATT ATT ATC TAAAATC 600 

CGGTGGCTAC ATGGCAGCCA ATGAATGGAT TTGGGATAAG GAATC TTGGT TTTATCTCAA 66 0 
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ATTTGATGGG AAAATGGCTG AAAAAGAATG GGTCTACGAT TCTCATAGTC AAGCTTGGTA 72 0 

C T AC TTC AAA TCCGGTGGTT ACATGACAGC CAATGAATGG ATTTGGGATA AGGAATCTTG 7 80 

GTTTTACCTC AAATCTGATG GGAAAATAGC TGAAAAAGAA TGGGTCTACG ATTCTCATAG 840 
TCAAGCTTGG TACTACTTCA AATCTGGTGG CTACATGGCG AAAAATGAGA CAGTAGATGG 900 
TTATCAGCTT GGAAGC GATG GTAAATGGCT TGGAGGAAAA ACTACAAATG AAAATGC TGC 9 60 

TTACTATCAA GTAGTGCCTG TTACAGCCAA TGTTTATGAT TCAGATGGTG AAAAGC TTTC 102 0 

CTATATATCG CAAGGTAGTG. TCGTATGGCT AGATAAGGAT AGAAAAAGTG ATGACAAGCG 1080 

CTTGGCTATT ACTATTTCTG GTTTGTCAGG C T AT ATGAAA ACAGAAGATT TACAAGCGCT 1140 

AGATGC T AGT AAGGACTTTA TCCCTTATTA TGAGAGTGAT GGCCACCGTT TTTATCACTA 12 0 0 

TGTGGC TC AG AATGCTAGTA TCCCAGTAGC TTCTCATCTT TCTGATATGG AAGTAGGCAA 12 6 0 

GAAATATTAT TCGGCAGATG - GCCTGCATTT TGATGGTTTT AAGCTTGAGA ATCCCTTCCT 13 2 0 

TTTCAAAGAT TTAACAGAGG CTACAAACTA CAGTGCTGAA GAATTGGATA AGGTATTTAG 13 80 

TTTGCTAAAC ATTAACAATA GCCTTTTGGA GAACAAGGGC GCTACTTTTA AGGAAGCCGA 1440 

AGAACATTAC CATATCAATG CTCTTTATCT CCTTGCCCAT AGTGCCCTAG AAAGTAACTG 1500 

GGGAAGAAGT AAAATTGCCA AAGATAAGAA TAATTTCTTT GGCATTACAG CCTATGATAC 156 0 

GACCCCTTAC C TTTC TGC T A AGACATTTGA TGATGTGGAT AAGGGAATTT TAGGTGCAAC 162 0 

CAAGTGGATT AAGGAAAATT ATATCGATAG GGGAAGAACT TTCCTTGGAA ACAAGGCTTC 168 0 

TGGTATGAAT GTGGAATATG CTTCAGACCC TTATTGGGGC GAAAAAATTG CTAGTGTGAT 1740 

GATGAAAATC AATGAGAAGC TAGGTGGCAA AGAT 111 A 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ser Asp Gly Thr Trp Gin Gly Lys Gin Tyr Leu Lys Glu Asp Gly Ser 
15 10 15 

Gin Ala Ala Asn Glu Trp Val Xaa Asp Thr His Tyr Gin Ser Trp Phe 
20 25 30 

Tyr lie Lys Ala Asp Ala Asn Tyr Ala Glu Asn Glu Trp Leu Lys Gin 
35 40 45 

Gly Asp Asp Tyr Phe Tyr Leu Lys Ser Gly Gly Tyr Met Ala Lys Ser 
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Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp Gin Asp Gly 
65 70 75 80 

Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val Gly Ala Thr 
85 90 95 

Gly Ala Lys Val He Glu Asp Trp Val Tyr Asp Ser Gin Tyr Asp Ala 
100 105 HO 

Trp Phe Tyr He Lys Ala Asp Gly Gin His Ala Glu Lys Glu Trp Leu 
115 120 125 

Gin He Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly Tyr Leu Leu 
130 135 140 

Thr Ser Gin Trp He Asn Gin Ala Tyr Val Asn Ala Ser Gly Ala Lys 
145 150 155 160 

Val Gin Gin Gly Trp Leu Phe Asp Lys Gin Tyr Gin Ser Trp Phe Tyr 
165 170 175 

He Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp He Phe Glu Asn 
180 185 190 

Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr Met Ala Ala Asn Glu 
195 200 205 

Trp He Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe Asp Gly Lys 
210 215 220 

Met Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr 
225 230 235 240 

Tvr Phe Lvs Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp He Trp Asp 
' y 245 250 255 

Lvs Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie Ala .Glu Lys 
260 265 270 

Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser 
275 280 285 

Glv Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gly Tyr Gin Leu Gly 
290 295 300 

Ser Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala 
305 310 315 320 

Tvr Tvr Gin Val Val Pro Val Thr Ala Asn Val Tyr Asp Ser Asp Gly 
325 330 335 

Glu Lys Leu Ser Tyr He Ser Gin Gly Ser Val Val Trp Leu Asp Lys 
340 345 350 

Asp Arg Lys Ser Asp Asp Lys Arg Leu Ala He Thr lie Ser "Gly Leu 
355 360 365 

Ser Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys 
370 375 380 



Asp Phe He Pro Tyr Tyr Glu Ser Asp Gly His Arg Phe Tyr His Tyr 
385 390 395 400 

Val Ala Gin Asn Ala Ser lie Pro Val Ala Ser His Leu Ser Asp Met 
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405 410 415 

Glu Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly 
420 425 430 

Phe Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala Thr 
435 440 445 

Asn Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn He 
450 455 460 

Asn Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu 
465 470 475 480 

Glu His Tyr His He Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu 
485 490 495 

Glu Ser Asn Trp Gly Arg Ser Lys He Ala Lys Asp Lys Asn Asn Phe 
500 505 510 

Phe Gly He Thr Ala Tyr Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr 
515 520 525 

Phe Asp Asp Val Asp Lys Gly He Leu Gly Ala Thr Lys Trp He Lys 
530 535 540 

Glu Asn Tyr He Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser 
545 550 555 560 

Gly Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys He 
565 570 • 575 

Ala Ser Val Met Met Lys He Asn Glu Lys Leu Gly Gly Lys Asp 
580 585 590 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

TGGGATTCAA TATGTCAGAG ATGATACTAG AGATAAAGAA GAGGGAATAG AGTATGATGA 60 

CGCTGACAAT GGGGATATTA TTGTAAAAGT AGC GAC T AAA CCTAAGGTAG TAACCAAGAA 12 0 

AATTTCAAGT ACGCGAATTC GTTATGAAAA AGATGAAACA AAAGACC GT A GTGAAAATCC 180 

TGTTACAATT GATGGAGAGG ATGGCTATGT AACTACGACA AGGACCTACG ATGTT AATC C 240 

AGAGACTGGT T ATGTT AC C G AACAGGTTAC TGTTGATAGA AAAGAAGCCA CGGATACAGT 3 00 

TATCAAAGTT CCAGCTAAAA GCAAGGTTGA AGAAGTTCTT GTTC C ATTTG CTACTAAATA 3 60 

TGAAGCAGAC AATGACCTTT CTGCAGGACA GGAGCAAGAG ATTACTCTAG GAAAGAATGG 42 0 

GAAAACAGTT ACAACGATAA CTTATAATGT AGATGGAAAG AGTGGACAAG TAAC TGAGAG 480 

TACTTTAAGT CAAAAAAAAG ACTCtCAAAC AAGAGTTGTT AAAAAAAGaA CCArkCCCCA 54 0 
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AGTTCTTGTC CAAGAAATTC CAATCGAAAC AGAATATCTC GATGGCCCaA CTCTTGATAA 600 

AaGTCAAGAA GTAGAAGAAG TAGGAGAAAT TGGTAAATTA CTCTTACTAC AATCTATACT 6 60 

GGTAGATGAA CGTGATGGAA CAATTGAAGA AACTACTTCT CGTCAAATTA CTAAAGAGAT 720 

GGTAAAAAGA CGTATAAGGA GAGGGACGAG AGAACCTGAA AAAGTTGTTG TTCCTGAGCA 7 80 

ATCATCTATT CCTTCGTATC CTGTATCTGT TACATCTAAC CAAGGAACAG ATGTAGCAGT 840 

AGAACCAGCT AAAGCAGTTG CTCCAACAAC AGACTGGAAA CAAGAAAATG GTATGTGGTA 900 

TTTTTATAAT ACTGATGGTT CCATGGCAAC AGGTTGGGTA CAAGTTAATA GTTCATGGTA 960 

CTACCTCAAC AGCAACGGTT CTATGAAAGT CAATCAATGG TTC CAAGTTG GTGGTAAATG 1020 

GTATTATGTA AATACATCGG GTGAGTTAGC GGTCAATACA AGTATAGATG GCTATAGAGT 1080 
CAATGATAAT GGTGAATGGG TGCGT 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 68 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly He Gin Tyr Val Arg Asp Asp Thr Arg Asp Lys Glu Glu Gly He 
1 5 10 15 

Glu Tyr Asp Asp Ala Asp Asn Gly Asp He He Val • Lys Val Ala Thr 
20 25 30 

Lys Pro Lys Val Val Thr Lys Lys He Ser Ser Thr Arg He Arg Tyr 
35 40 45 

Glu Lys Asp Glu Thr Lys Asp Arg Ser Glu Asn Pro Val Thr He Asp 
50 55 60 

Gly Glu Asp Gly Tyr Val Thr Thr Thr Arg Thr Tyr Asp Val Asn Pro 
65 70 , 75 80 

Glu Thr' Gly Tyr Val Thr Glu Gin Val Thr Val Asp Arg Lys Glu Ala 
85 - 90 95 

Thr Asp Thr Val He Lys Val Pro Ala Lys Ser Lys Val Glu Glu Val 
100 1° 5 110 

Leu Val Pro Phe Ala Thr Lys Tyr Glu Ala Asp Asn Asp Leu Ser Ala 
115 120 125 

Glv Gin Glu Gin Glu He Thr Leu Gly Lys Asn Gly Lys Thr Val Thr 
130 135 140 

Thr He Thr Tyr Asn Val Asp Gly Lys Ser Gly Gin Val Thr Glu Ser 
145 150 155 160 
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Thr Leu Ser Gin Lys Lys Asp Ser Gin Thr Arg Val Val Lys Lys Arg 
165 170 175 



170 175 



Thr Xaa Pro Gin Val Leu Val Gin Glu He Pro He Glu Thr Glu Tyr 

185 190 



180 



Leu Asp Gly Pro Thr Leu Asp Lys Ser Gin Glu Val Glu Glu Val Gly 
- 200 205 



195 



Glu He Gly Lys Leu Leu Leu Leu Gin Ser He Leu Val Asp Glu Arg 

215 220 



210 



Asp Gly Thr He Glu Glu Thr Thr Ser Arg Gin He Thr Lys Glu Met 
225 230 235 240 

Val Lys Arg Arg He Arg Arg Gly Thr Arg Glu Pro Glu Lys Val Val 



245 



Val Pro Glu Gin Ser Ser He Pro Ser Tyr Pro Val Ser Val Thr Ser 
260 265 270 

Asn Gin Gly Thr Asp Val Ala Val Glu Pro Ala Lys Ala Val Ala Pro 
--- 280 285 



275 



Thr Thr Asp Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr 
295 300 



290 



Asp Gly Ser Met Ala Thr Gly Trp Val Gin Val Asn Ser Ser Trp Tyr 
305 310 315 320 

Tyr Leu Asn Ser Asn Gly Ser Met Lys Val Asn Gin Trp Phe Gin Val 
y 325 330 335 

Gly Gly Lys Trp Tyr Tyr Val Asn Thr Ser Gly Glu Leu Ala Val Asn 
340 345 350 

Thr Ser He Asp Gly Tyr Arg Val Asn Asp Asn Gly Glu Trp Val Arg 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: - 

GGATAATAGA GAAGCATTAA AAACCTTTAT GAC GGGTGAA AATTTTTATC TCCAACATTA 60 

TC TAGGAGC A CATAGGGAAG AACTAAATGG AGAGCATGGC TAT AC C TT C C GTGTTTGGGC 120 

ACCTAATGCT CAGGCTGTTC ACTTGGTTGG TGATTTTACC AACTGGATTG AAAATCAGAT 180 

TCCAATGGTA AGAAATGATT TTGGGGTCTG GGAAGTCTTT ACCAATATGG CTCAAGAAGG 240 

GCATATTTAC AAATATCATG TCACACGTCA AAATGGTCAT CAACTGATGA AG ATT GAC C C 3 00 

TTTTGCTGTC AGGTATGAGG CTCGTCCAGG AACAGGGGCA ATCGTAACAG AGCTTCCTGA 3 60 

GAAGAAATGG AAGGATGGAC TTTGGCTGGC ACGAAGAAAA CGTTGGGGCT TTGAAG AGC G 42 0 
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661 



TCCTGTCAAT ATTTATGAAG TTCACGCTGG ATCATGGAAA AGAAATTCTG ATGGCAGTCC 480 

TTATAGTTTT GCCCAGCTCA AGGATGAACT CATTCCTTAT CTCGTTGAAA TGAAC TAT AC 540 

TCATATTGAG TTTATGCCCT TGATGTCCCA TCCTTTGGGC TTGAGTTGGG GGTATCAGCT 600 

TATGGGTTAC TTCGCTTTAG AGCATGCTTA TGGCCGACCA GAGGAGTTTC AAGATTTTGT 660 
C 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 8 : 

Asp Asn Arg Glu Ala Leu Lys Thr Phe Met Thr Gly Glu Asn Phe Tyr 
1 5 10 15 

Leu Gin His Tyr Leu Gly Ala His Arg Glu Glu Leu Asn Gly Glu His 
20 25 " 30 

Gly Tyr Thr Phe Arg Val Trp Ala Pro Asn Ala Gin Ala Val His Leu 
35 40 45 

Val Gly Asp Phe Thr Asn Trp lie Glu Asn Gin lie Pro Met Val Arg 
50 55 60 

Asn Asp Phe Gly Val Trp Glu Val Phe Thr Asn Met Ala Gin Glu Gly 
65 70 75 80 

His lie Tyr Lys Tyr His Val Thr Arg Gin Asn Gly His Gin Leu Met 
85 ■ 90 95 

Lys He Asp Pro Phe Ala Val Arg Tyr Glu Ala Arg Pro Gly Thr Gly 
100 105 110 

Ala He Val Thr Glu Leu Pro Glu Lys Lys Trp Lys Asp Gly Leu Trp 
115 120 125 

Leu Ala Arg Arg Lys Arg Trp Gly Phe Glu Glu Arg Pro Val Asn He 
130 135 140 

Tyr Glu Val His Ala Gly Ser Trp Lys Arg Asn Ser Asp Gly Ser Pro 
145 150 155 160 

Tvr Ser Phe Ala Gin Leu Lys Asp Glu Leu He Pro Tyr Leu Val Glu 
165 170 175 

Met Asn Tyr Thr His He Glu Phe Met Pro Leu Met Ser His Pro Leu 
180 185 190 

Gly Leu Ser Trp Gly Tyr Gin Leu Met Gly Tyr Phe Ala Leu Glu His 
195 200 ^ 205 

Ala Tyr Gly Arg Pro Glu Glu Phe Gin Asp Phe Val 
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210 215 220 

(2) INFORMATION FOR SEQ ID NO : 79: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 97 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



AGATTTTGTC 


GAGGAGTGTC 


AT AC C C AT AA 


TATTGGGGTT 


ATTGTGG AC T 


GGGTAC C AGN 


60 


TCACTTTACC 


ATCAACGATG 


ATGCCTTAGC 


CTATTATGAT 


GGGACACCGA 


CTTTTGAATA 


120 


C C AAGAC CAT 


AATAAGGCTC 


ATAACCATGG 


TTGGGGTGCC 


CTTAATTTTG 


ACCTTGGAAA 


180 


AAATGAAGTC 


CAGTCCTTCT 


TAATTTCTTG 


CATTAAGCAT 


TGGATTGATG 


TCTATCATTT 


240 


GGATGGTATT 


CGTGTGGATG 


CTGTTAGCAA 


CATGCTCTAT 


TTGGACTATG 


ATGATGCTCC 


300 


ATGGACACCT 


AATAAAGATG 


GCGGAAATCT 


C AAC TATGAA 


GGTTATTATT 


TCCTTCAGCG 


360 


CTTGAATGAG 


GTTATTAAGT 


TAGAATATCC 


AGATGTGATG 


ATGATTGCAG 


AAGAAAGTTC 


420 


GTCTGCGATC 


AAGATTACGG 


GAATGAAAGA 


GATTGGTGGT 


CTAGGATTTG 


ACTACAAATG 


480 


GAACATGGGC 


TGGATGAATG 


ATATCCTCCG 


TTTCTACGAA 


GAAGATCCGA 


TCTATCGTAA 


540 


ATATGACTTT 


AACCTGGTGA 


CTTTCAGCTT 


TATGTATGTT 


TNCAAGGAGA 


ATTATCTCTT 


600 


GCCATTCTCG 


C AC GATGAAG 


TGGTTCATGG 


CAAGAAGAGT 


ATGATGCATA 


AGAT GTGGGG 


660 


AGATC GTT AC 


AATCAATTCG 


CAGGCTTGCG 


CAATCTCTAT 


ACGTACCAAA 


TTTGTCACCC 


720 


TGGTAAGAAA 


TTGCTCTTCA 


TGGGTAGCGA 


ATACGGTCAA 


TTCCTAGAAT 


GGAAATCTGA 


78 0 


AGAACAGTTG 


GAATGGTC T A 


ACCTAGAAGA 


CCCAATGAAT 


GCTAAGATGA 


AGTATTTCGC 


840 


TTCTCAGCTA 


AACCAGTTTT 


ACAAAGATCA 


TCGCTGTCTG 


TGGGAAATTG 


ATACCAGCTA 


900 


TGATGGTATT 


GAAATCATTG 


ATGCGGATAA 


TCGAGACCAG 


AGTGTTCTTT 


CCTTTATTCG 


960 


TAAGGGTAAA 


AAGGGA 










976 



(2) INFORMATION FOR SEQ ID NO: 80: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Asp Phe val Glu Glu Cys His Thr His Asn lie Gly Val lie Val Asp 
15 10 15 
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Trp Val Pro Xaa His Phe Thr He Asn Asp Asp Ala Leu Ala Tyr Tyr 
20 25 30 

Asp Glv Thr Pro Thr Phe Glu Tyr Gin Asp His Asn Lys Ala His Asn 
35 40 45 

His Gly Trp Gly Ala Leu Asn Phe Asp Leu Gly Lys Asn Glu Val Gin 
50 55 60 

Ser Phe Leu He Ser Cys He Lys His Trp He Asp Val Tyr His Leu 
65 70 75 80 

Asp Gly He Arg Val Asp Ala Val Ser Asn Met Leu Tyr Leu Asp Tyr 
85 90 95 

Asp Asp Ala Pro Trp Thr Pro Asn Lys Asp Gly Gly Asn Leu Asn Tyr 
100 105 HO 

Glu Gly Tyr Tyr Phe Leu Gin Arg Leu Asn Glu Val He Lys Leu Glu 
115 120 125 

Tvr Pro Asp Val Met Met He Ala Glu Glu Ser Ser Ser Ala He Lys 
130 135 140 

He Thr Gly Met Lys Glu He Gly Gly Leu Gly Phe Asp Tyr Lys Trp 
145 150 155 160 

Asn Met Gly Trp Met Asn Asp He Leu Arg Phe Tyr Glu Glu Asp Pro 
165 I 70 175 

He Tyr Arg Lys Tyr Asp Phe Asn. Leu Val Thr Phe Ser Phe Met Tyr 
180 i85 . 190 

Val Xaa Lys Glu Asn Tyr Leu Leu Pro Phe Ser His Asp Glu Val Val 
195 200 205 

His Gly Lys Lys Ser Met Met His Lys Met Trp Gly Asp Arg Tyr Asn 
210 215 220 

Gin Phe Ala Gly Leu Arg Asn Leu Tyr Thr Tyr Gin He Cys His Pro 
225 230 235 240 

Gly Lys Lys Leu Leu Phe Met .Gly Ser Glu Tyr Gly Gin Phe Leu Glu 
245 250 255 

Trp Lys Ser Glu Glu Gin Leu Glu Trp Ser Asn Leu Glu Asp Pro Met 
260 265 270 

' Asn Ala Lys Met Lys Tyr Phe Ala Ser Gin Leu Asn Gin Phe Tyr Lys 
275 280 285 

Asp His Arg Cys Leu Trp Glu He Asp Thr Ser Tyr Asp Gly He Glu 
290 295 300 

He He Asp Ala Asp Asn Arg Asp Gin Ser Val Leu Ser Phe He Arg 
305 310 315 320 

Lys Gly Lys Lys Gly 
325 

) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2134 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 

ATCTGTAGTT TATGCGGATG AAACACTTAT TACTCATACT GCTGAGAAAC CTAAAGAGGA 60 

AAAAATGATA GTAGAAGAAA AGGCTGATAA AGCTTTGGAA ACTAAAAATA TAGTTGAAAG 120 

GACAGAACAA AGTGAACCTA GTTCAACTGA GGCTATTGCA TCTGAGNAGA AAGAAGATGA 180 

AGCCGTAACT CCAAAAGAGG AAAAAGTGTC TGCTAAACCG GAAGAAAAAG CTCCAAGGAT 240 

AGAATCACAA GCTTCAAATC AAGAAAAACC GCTCAAGGAA GATGCTAAAG CTGTAACAAA 3 00 

TGAAGAAGTG AATCAAATGA TTGAAGACAG GAAAGTGGAT TTTAATCAAA ATTGGTACTT 3 60 

TAAACTCAAT GCAAATTCTA AGGAAGC CAT TAAACCTGAT GCAGACGTAT CTACGTGGAA 42 0 

AAAATTAGAT TTACCGTATG ACTGGAGTAT C TTTAAC GAT TTCGATCATG AATCTCCTGC 480 

ACAAAATGAA GGTGGACAGC TCAACGGTGG GGAAGCTTGG TATCGCAAGA CTTTCAAACT 540 

AGATGAAAAA GACCTCAAGA AAAATGTTCG CCTTACTTTT GATGGCGTCT ACATGGATTC 60 0 

TCAAGTTTAT GTCAATGGTC AGTTAGTGGG GCATTATCCA AATGGTTATA ACCAGTTCTC 660 

ATATGATATC AC C AAAT AC C TTCAAAAAGA TGGTC GTG AG AATGTGATTG CTGTCCATGC 72 0 

AGTCAACAAA CAGCCAAGTA GCCGTTGGTA TTCAGGAAGT GGTATCTATC GTGATGTGAC 78 0 

TTTACAAGTG ACAGATAAGG TGCATGTTGA GAAAAATGGG ACAACTATTT TAACACCAAA 840. 

ACTTGAAGAA CAACAACATG GCAAGGTTGA AACTCATGTG ACCAGCAAAA TCGTCAATAC 900 

GGAC G AC AAA GACCATGAAC TTGTAGCCGA ATATCAAATC GTTGAACGAG GTGGTCATGC 960 

TGTAACAGGC TTAGTTCGTA CAGCGAGTCG TACCTTAAAA GCACATGAAT CAACAAGCCT 102 0 

AGATGC GATT TTAGAAGTTG AAAGACCAAA ACTCTGGACT GTTTTAAATG ACAAACCTGC 108 0 

CTTGTACGAA TTGATTACGC GTGTTTACCG TGACGGTCAA TTGGTTGATG CTAAGAAGGA 1140 

TTTGTTTGGT TACCGTTACT ATC AC TGGAC TCCAAATGAA GGTTTCTCTT TGAATGGTGA 120 0 

ACGTATTAAA TTCCATGGAG TATC CTTGC A CCACGACCAT GGGGCGCTTG GAGCAGAAGA 126 0 

AAACTATAAA GCAGAATATC GCCGTCTCAA ACAAATGAAG GAGATGGGAG TTAACTCCAT 132 0 

CCGTACAACC CACAACCCTG CTAGTGAGCA AACCTTGCAA ATCGCAGCAG AACTAGGTTT 138 0 

ACTCGTTCAG GAAGAGGCCT TTGATACGTG GTATGGTGGC AAGAAACCTT ATGACTATGG 144 0 

ACGTTTCTTT GAAAAAGATG CCACTCACCC AGAAGCTCGA AAAGGTGAAA AATGGTCTGA 150 0 

TTTTGACCTA CGTACCATGG TCGAAAGAGG CAAAAACAAC CCTGCTATCT TCATGTGGTC 1560 

AATTGGTAAT GAAATAGGTG AAGCTAATGG TGATGCCCAC TCTTTAGCAA CTGTTAAACG 162 0 

TTTGGTTAAG GTTATCAAGG ATGTTGATAA G AC TC GC TAT GTTACCATGG GAGCAGATAA 168 0 
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I860 
1920 
1980 
2040 
2100 
2134 



ATTCCGTTTC GGTAATGGTA GCGGAGGGCA TGAGAAAATT GCTGATGAAC TCGATGCTGT 1740 
TGGATTTAAC TATTCTGAAG ATAATTACAA AGCCCTTAGA GCTAAGCATC CAAAATGGTT 1800 
GATTTATGGA TCAGAAACAT CTTCAGCTAC C C GT AC AC GT GGAAGTTACT ATCQCCCTGA 
ACGTGAATTG AAACATAGCA ATGGACCTGA GCGTAATTAT GAACAGTCAG ATTATGGAAA 
TGATCGTGTG GGTTGGGGGA AAACAGCAAC CGCTTCATGG ACTTTTGACC GTGACAACGC 
TGGCTATGCT GGACAGTTTA TCTGGACAGG T AC GG AC TAT ATTGGTGAAC CTACACCATG 
GCACAACCAA AATCAAACTC CTGTTAAGAG CTCTTACTTT GGTATCGTAG ATACAGCCGG 
CATTCCAAAA CATGACTTCT ATCTCTACCA AAGC 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 711 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Ser val Val Tyr Ala Asp Glu Thr Leu lie Thr His Thr Ala Glu Lys 
x 5 10 

Pro Lys Glu Glu Lys Met He Val Glu Glu Lys Ala Asp Lys Ala Leu 



20 25 30 

Glu Thr Lys Asn He Val Glu Arg Thr Glu Gin Ser Glu .Pro Ser Ser 
3 5 

Thr Glu Ala lie Ala Ser Glu Xaa Lys Glu Asp Glu Ala Val Thr Pro 

50 55 b 

Lys Glu Glu Lys Val Ser Ala Lys Pro Glu Glu Lys Ala Pro Arg lie 

65 ^ ^ 

Glu Ser Gin Ala Ser Asn Gin Glu Lys Pro Leu Lys Glu Asp Ala Lys 
85 90 

Ala Val Thr Asn Glu Glu Val Asn Gin Met He Glu Asp Arg Lys Val 

100 105 
Asp Phe Asn Gin Asn Trp Tyr Phe Lys Leu Asn Ala Asn Ser Lys Glu 

Ala He Lys Pro Asp Ala Asp Val Ser Thr Trp Lys Lys Leu Asp Leu 

130 135 
Pro Tyr Asp Trp Ser He Phe Asn Asp Phe Asp His Glu Ser Pro Ala 
145 150 133 

Gin Asn Glu Gly Gly Gin Leu Asn Gly Gly Glu Ala Trp Tyr Arg Lys 



165 1^0 

^t,, t , rr a en T.^n t,vs Lvs Asn Val 

190 



Thr Phe Lys Leu Asp Glu Lys Asp Leu Lys Lys Asn Val Arg Leu Thr 
180 185 
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Phe Asp Gly Val Tyr Met Asp Ser Gin Val Tyr Val Asn Gly Gin Leu 
195 200 205 

Val Gly His Tyr Pro Asn Gly Tyr Asn Gin Phe Ser Tyr Asp He Thr 
210 215 220 

Lvs Tyr Leu Gin Lys Asp Gly Arg Glu Asn Val He Ala Val His Ala 
225 230 235 240 

Val Asn Lys Gin Pro Ser Ser Arg Trp Tyr Ser Gly Ser Gly lie Tyr 
245 250 255 

Arg Asp Val Thr Leu Gin Val Thr Asp Lys Val His Val Glu Lys Asn 
260 265 270 

Glv Thr Thr He Leu Thr Pro Lys Leu Glu Glu Gin Gin His Gly Lys 
275 280 285 

Val Glu Thr His Val Thr Ser Lys He Val Asn Thr Asp Asp Lys Asp 
290 295 300 

His Glu Leu Val Ala Glu Tyr Gin He Val Glu Arg Gly Gly His Ala 
305 310 315 320 

Val Thr Gly Leu Val Arg Thr. Ala Ser Arg Thr Leu Lys Ala His Glu 
325 330 335 

Ser Thr Ser Leu Asp Ala He Leu Glu Val Glu Arg Pro Lys Leu Trp 
340 345 350 

Thr Val Leu Asn Asp Lys Pro Ala Leu Tyr Glu Leu He Thr Arg Val 
355 360 365 

Tyr Arg Asp Gly Gin Leu Val Asp Ala Lys Lys Asp Leu Phe Gly Tyr 
370 375 380 

Ara Tvr Tvr His Trp Thr Pro Asn Glu Gly Phe Ser Leu Asn Gly Glu 
385 390 395 400 

Ara He Lys Phe His Gly Val Ser Leu His His Asp His Gly Ala Leu 
9 405 410 415 

Glv Ala Glu Glu Asn Tyr Lys Ala Glu Tyr Arg Arg Leu Lys Gin Met 
420 4 25 430 

Lvs Glu Met Gly Val Asn Ser He Arg Thr Thr His Asn Pro Ala Ser 
J 435 440 445 

Glu Gin Thr Leu Gin lie Ala Ala Glu Leu Gly Leu Leu Val Gin Glu 
450 455 460 

Glu Ala Phe Asp Thr Trp Tyr Gly Gly Lys Lys Pro Tyr Asp Tyr Gly 
465 470 475 480 

Ara Phe Phe Glu Lys Asp Ala Thr His Pro Glu Ala Arg Lys Gly Glu 
485 490 495 

Lvs Trp Ser Asp Phe Asp Leu Arg Thr Met Val Glu Arg Gly Lys Asn 
500 505 510 

Asn Pro Ala He Phe Met Trp Ser He Gly Asn Glu He Gly Glu Ala 
515 520 525 
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Asn Gly Asp Ala His Ser Leu Ala Thr Val Lys Arg Leu Val Lys Val 
530 535 540 

lie Lys Asp Val Asp Lys Thr Arg Tyr Val Thr Met Gly Ala Asp Lys 
545 550 555 560 

Phe Arg Phe Gly Asn Gly Ser Gly Gly His Glu Lys lie Ala Asp Glu 
565 570 575 

Leu Asp Ala Val Gly Phe Asn Tyr Ser Glu Asp Asn Tyr Lys Ala Leu 
580 585 590 

Arcr Ala Lys His Pro Lys Trp Leu lie Tyr Gly Ser Glu Thr Ser Ser 
595 600 , 605 

Ala Thr Arg Thr Arg Gly Ser Tyr Tyr Arg Pro Glu Arg Glu Leu Lys 
610 615 620 

His Ser Asn Gly Pro Glu Arg Asn Tyr Glu Gin Ser Asp Tyr Gly Asn 
625 630 635 640 

Asp Arg Val Gly Trp Gly Lys Thr Ala Thr Ala Ser Trp Thr Phe Asp 
645 650 655 

Arg Asp Asn Ala Gly Tyr Ala Gly Gin Phe He Trp Thr Gly Thr Asp 
660 665 670 

Tvr He Gly Glu Pro Thr Pro Trp His Asn Gin Asn Gin Thr Pro Val 
675 680 685 

Lys Ser Ser Tyr Phe Gly He Val Asp Thr Ala Gly He Pro Lys His 
690 695 700 

Asp Phe Tyr Leu Tyr Gin Ser 
705 710 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2167 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

TTACTTTGGT ATC GTAGATA CAGCCGGCAT TCCAAAACAT GACTTCTATC TCTACCAAAG 60 

CCAATGGGTT TCTGTTAAGA AGAAACCGAT GGTACACCTT CTTCCTCACT GGAACTGGGA 12 0 

AAACAAAGAA TTAGCATCCA AAGTAGCTGA C TC AGAAGGT AAGATTC C AG TTCGTGCTTA 180 

TTCGAATGCT TCTAGTGTAG AATTGTTCTT GAATGGAAAA TCTCTTGGTC TTAAGACTTT 240 

CAATAAAAAA CAAACCAGCG ATGGGC GGAC TTACCAAGAA GGTGCAAATG C T AATGAACT 3 00 

TTATCTTGAA TGGAAAGTTG CCTATCAACC AGGTACCTTG GAAGCAATTG CTCGTGATGA 360 

ATCTGGCAAG G AAATTGC TC GAGATAAGAT TACGACTGCT GGTAAGCCAG CGGCAGTTCG 420 

TCTTATTAAG GAAGACCATG CGATTGCAGC AGATGGAAAA GACTTGACTT ACATCTACTA 480 
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TGAAATTGTT 


GACAGCCAGG 


GGAATGTGGT 


TCCAACTGCT 


AATAATCTGG 


TTCGCTTCCA 


540 


f 

ATTGCATGGC 


CAAGGTCAAC 


TGGTCGGTGT 


AG AT AAC GG A 


GAACAAGCCA 


GCCGTGAACG 


600 


CTATAAGGCG 


CAAGCAGATG 


GTTCTTGGAT 


TCGTAAAGCA 


TTTAATGGTA 


AAGGTGTTGC 


660 


CATTGTCAAA 


TCAACTGAAC 


AAGCAGGGAA 


ATTCACCCTG 


ACTGCCCACT 


CTGATCTCTT 


720 


GAAATCGAAC 


CAAGTCACTG 


TCTTTACTGG 


TAAGAAAGAA 


GGACAAGAGA 


AG AC TGTT TT 


780 


GGGGACAGAA 


GTGCCAAAAG 


TACAGACCAT 


TATTGGAGAG 


GCACCTGAAA 


TGCCTACCAC 


840 


TGTTCCGTTT 


GTATACAGTG 


ATGGTAGCCG 


TGCAGAACGT 


CCTGTAACCT 


GGTCTTCAGT 


900 


AGATGTGAGC 


AAGCCTGGTA 


TTGTAACGGT 


GAAAGGTATG 


GC TGAC GGAC 


GAGAAGTAGA 


960 


AGCTCGTGTA 


GAAGTGATTG 


CTCTTAAATC 


AGAGCTACCA 


GTTGTGAAAC 


GTATTGCTCC 


1020 


AAAT AC TGAC 


TTGAATTCTG 


TAGACAAATC 


TGTTTCCTAT 


GTTTTGATTG 


ATGGAAGTGT 


1080 


TGAAGAGTAT 


GAAGTGGACA 


AGTGGGAGAT 


TGCCGAAGAA 


GATAAAGCTA 


AGTTAGCAAT 


1140 


TCCAGGTTCT 


CGTATTCAAG 


C G AC C GGTT A 


TTTAGAAGGT 


CAACCAATTC 


ATGCAACCCT 


1200 


TGTGGTAGAA 


GAAGGCAATC 


CTGCGGCACC 


TGCAGTACCA 


ACTGTAACGG 


TTGGTGGTGA 


1260 


GGCAGTAACA 


GGTCTTACTA 


GTCAAAAACC 


AATGC AAT AC 


CGCACTCTTG 


CTTATGGAGC 


1320 


T AAGTTGC C A 


GAAGTCACAG 


CAAGTGCTAA 


AAATGCAGCT 


GTTACAGTTC 


TTCAAGCAAG 


1380 


CGCAGCAAAC 


GGCATGCGTG 


CGAGCATCTT 


TATTCAGCCT 


AAAGATGGTG 


GCCCTCTTCA 


1440 


AACCTATGCA 


ATTCAATTCC 


TTGAAGAAGC 


GCCAAAAATT 


GCTCACTTGA 


GCTTGCAAGT 


1500 


GGAAAAAGCT 


GACAGTCTCA 


AAGAAGACCA 


AACTGTCAAA 


TTGTCGGTTC 


GAGCTCACTA 


1560 


TCAAGATGGA 


ACGCAAGCTG 


TATTACCAGC 


TGATAAAGTA 


ACCTTCTCTA 


CAAGTGGTGA 


1620 


AGGGGAAGTC 


GCAATTCGTA 


AAGGAATGCT 


TGAGTTGCAT 


AAGCCAGGAG 


CAGTCACTCT 


1680 


GAACGCTGAA 


TATGAGGGAG 


CTAAAGACCA 


AGTTGAACTC 


ACTATCCAAG 


CCAATACTGA 


1740 


GAAGAAGATT 


GCGCAATCCA 


TCCGTCCTGT 


AAATGTAGTG 


ACAGATTTGC . 


ATC AGGAAC C 


1800 


AAGTCTTCCA 


GCAACAGTAA 


CAGTTGAGTA 


TGACAAAGGT 


TTCCCTAAAA 


CTCATAAAGT 


1860 


CACTTGGCAA 


GCTATTCCGA 


AAGAAAAACT 


AGACTCCTAT 


CAAACATTTG 


" AAGTACTAGG 


1920 


TAAAGTTGAA 


GGAATTGACC 


TTGAAGCGCG 


TGCAAAAGTC 


TCTGTAGAAG 


GT ATC GTTTC 


1980 


AGTTGAAGAA 


GTCAGTGTGA 


CAACTCCAAT 


CGCAGAAGCA 


C C AC AATTAC 


CAGAAAGTGT 


2040 


TC GGAC AT AT 


GATTCAAATG 


GTC AC GTTTC 


ATCAGCTAAG 


GTTGCATGGG 


ATGCGATTCG 


2100 


TCCAGAGCAA 


TACGCTAAGG 


AAGGTGTCTT 


TAC AG TT AAT 


GGTCGCTTAG 


AAGGTACGCA 


2160 


ATTAACA 












2167 


(2) INFORMATION FOR SEQ ID NO: 84 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 84: 

Tvr Phe Gly lie Val Asp Thr Ala Gly lie Pro Lys His Asp Phe Tyr 
1 5 10 15 

Leu Tyr Gin Ser Gin Trp Val Ser Val Lys Lys Lys Pro Met Val His 
■20 25 30 

Leu Leu Pro His Trp Asn Trp Glu Asn Lys Glu Leu Ala Ser Lys Val 
35 40 ^ 45 

Ala Asp Ser Glu Gly Lys lie Pro Val Arg Ala Tyr Ser Asn Ala Ser 
50 ♦ 55 60 

Ser Val Glu Leu Phe Leu Asn Gly Lys Ser Leu Gly Leu Lys Thr Phe 
65 70 75 80 

Asn Lys Lys Gin Thr Ser Asp Gly Arg Thr Tyr Gin Glu Gly Ala Asn 
85 90 95 

Ala Asn Glu Leu Tyr Leu Glu Trp Lys Val Ala Tyr Gin Pro Gly Thr 
100 105 HO . 

Leu Glu Ala lie Ala Arg Asp Glu Ser Gly Lys Glu lie Ala Arg Asp 
115 120 125 

Lvs lie Thr Thr Ala Gly Lys Pro Ala Ala Val Arg Leu lie Lys Glu 
130 135 140 

Asp His Ala He Ala Ala Asp Gly Lys Asp Leu Thr Tyr He Tyr Tyr 
145 150 155 160 

Glu lie Val Asp Ser Gin Gly Asn Val Val Pro Thr Ala Asn Asn Leu 
165 170 175 

Val Arg Phe Gin Leu His Gly Gin Gly Gin Leu Val Gly Val Asp Asn 
180 185 190 

Gly Glu Gin Ala Ser Arg Glu Arg Tyr Lys Ala Gin Ala Asp Gly Ser 
195 200 205 

Trp He Arg Lys Ala Phe Asn Gly Lys Gly Val Ala He Val Lys Ser 
210 215 220 

Thr Glu Gin Ala Gly Lys Phe Thr Leu Thr Ala His Ser Asp Leu Leu 
225 230 235 240 

Lvs Ser Asn Gin Val Thr Val Phe Thr Gly Lys Lys Glu Gly Gin Glu 
245 250 255 

Lvs Thr Val Leu Gly Thr Glu Val Pro Lys Val Gin Thr He lie Gly 
260 265 270 

Glu Ala Pro Glu Met Pro Thr Thr Val Pro Phe Val Tyr Ser Asp Gly 
275 280 285 

Ser Arg Ala Glu Arg Pro Val Thr Trp Ser Ser Val Asp Val Ser Lys 
290 295 300 
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Pro Gly He Val Thr Val Lys Gly Met Ala Asp Gly Arg Glu Val Glu 
305 310 315 320 

Ala Arg Val Glu Val He Ala Leu Lys Ser Glu Leu Pro Val Val Lys 
325 330 335 

Arg He Ala Pro Asn Thr Asp Leu Asn Ser Val Asp Lys Ser Val Ser 
340 345 350 

Tyr Val Leu He Asp Gly Ser Val Glu Glu Tyr Glu Val Asp Lys Trp 
355 360 365 

Glu He Ala Glu Glu Asp Lys Ala Lys Leu Ala He Pro Gly Ser Arg 
370 375 380 

He Gin Ala Thr Gly Tyr Leu Glu Gly Gin Pro He His Ala Thr Leu 
385 390 395 400 

Val Val Glu Glu Gly Asn Pro Ala Ala Pro Ala Val Pro Thr Val Thr 
405 410 415 

Val Gly Gly Glu Ala Val Thr Gly Leu Thr Ser Gin Lys Pro Met Gin 
420 425 430 

Tyr Arg Thr Leu Ala Tyr Gly Ala Lys Leu Pro Glu Val Thr Ala Ser 
435 440 445 

Ala Lys Asn Ala Ala Val Thr Val Leu Gin Ala Ser Ala Ala Asn Gly 
450 455 460 

Met Arg Ala Ser He Phe He Gin Pro Lys Asp Gly Gly Pro Leu Gin 
465 470 475 480 

Thr Tyr Ala He Gin Phe Leu Glu Glu Ala Pro Lys He Ala His Leu 
485 490 495 

Ser Leu Gin Val Glu Lys Ala Asp Ser Leu Lys Glu Asp Gin Thr Val 
500 505 510 - 

Lys Leu Ser Val Arg Ala His Tyr Gin Asp Gly Thr Gin Ala Val Leu 
515 520 525 

Pro Ala Asp Lys Val Thr Phe Ser Thr Ser Gly Glu Gly Glu Val Ala 
530 535 540 

He Arq Lys Gly Met Leu Glu Leu His Lys Pro Gly Ala Val Thr Leu 
545 ■ 550 555 560 

Asn Ala Glu Tyr Glu Gly Ala Lys Asp Gin Val Glu Leu Thr He Gin 
565 570 575 

Ala Asn Thr Glu Lys Lys He Ala Gin Ser He Arg Pro Val Asn Val 
580 585 590 

Val Thr Asp Leu His Gin Glu Pro Ser Leu Pro Ala Thr Val Thr Val 
595 600 605 

Glu Tyr Asp Lys Gly Phe Pro Lys Thr His Lys Val Thr Trp Gin Ala 
610 615 620 

He Pro Lys Glu Lys Leu Asp Ser Tyr Gin Thr Phe Glu Val Leu Gly 
625 630 635 640 

Lys Val Glu Gly He Asp Leu Glu Ala Arg Ala Lys Val Ser Val Glu 
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645 650 655 

Glv He Val Ser Val Glu Glu Val Ser Val Thr Thr Pro He Ala Glu 
660 665 670 

Ala Pro Gin Leu Pro Glu Ser Val Arg Thr Tyr Asp Ser Asn Gly His 
675 680 685 

Val Ser Ser Ala Lys Val Ala Trp Asp Ala He Arg Pro Glu Gin Tyr 
690 695 700 

Ala Lys Glu Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin 
705 710 715 720 

Leu Thr 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

AGC T AAGGTT GCATGGGATG CGATTCGTCC AG AGC AAT AC GCTAAGGAAG GTGTCTTTAC 6 0 

AGTTAATGGT CGCTTAGAAG GTACGCAATT AAC AAC T AAA CTTCATGTTC GCGTATCTGC 12 0 

TCAAACTGAG CAAGGTGCAA ACATTTCTGA CCAATGGACC GGTTCAGAAT TGCCACTTGC 180 

CTTTGCTTCA GACTCAAATC CAAGCGACCC AGTTTCAAAT GTTAATGACA AGCTCATTTC 240 

C T AC AAT AAC CAACCAGCCA ATC GTTGGAC AAAC TGGAAT CGTACTAATC CAGAAGCTTC 300 

AGTCGGTGTT CTGTTTGGAG ATTCAGGTAT C TTG AGC AAA CGCTCCGTTG ATAATCTAAG 3 60 

TGTCGGATTC CATGAAGACC ATGGAGTTGG TGTACCGAAG TCTTATGTGA TTGAGTATTA 42 0 

TGTTGGTAAG ACTGTCCCAA CAGCTCCTAA AAACCCTAGT TTTGTTGGTA ATGAGGACCA 48 0 

TGTCTTTAAT GATTCTGCCA ACTGGAAACC AGTT AC T AAT CTAAAAGCCC CTGCTCAACT 540 

CAAGGCTGGA GAAATGAACC ACTTTAGCTT TGATAAAGTT GAAACCTATG CTGTTCGTAT 600 

TCGCATGGTT AAAGCAGATA AC AAGC GTGG AACGTCTATC ACAGAGGTAC AAATCTTTGC 66 0 

GAAACAAGTT GCGGCAGCCA AGCAAGGACA AACAAGAATC C AAGTTGAC G GCAAAGACTT 72 0 

AGC AAAC TTC AACCCTGATT TGACAGACTA CTACCTTGAG TCTGTAGATG GAAAAGTTCC 780 

GGCAGTCACA GCAAGTGTTA GCAACAATGG TCTCGCTACC GTCGTTCCAA GCGTTCGTGA 840 

AGGTGAGC C A GTTCGTGTCA TCGCGAAAGC TGAAAATGGC GACATCTTAG G AGAAT AC C G 900 

TCTGCACTTC ACTAAGGATA AGAGCTTACT TTC TC AT AAA CCAGTTGCTG C GGTTAAAC A 9 60 

AGCTCGCTTG CTACAAGTAG GTCAAGCACT TGAATTGCCG ACTAAGGTTC CAGTTTACTT 102 0 

CACAGGTAAA GACGGCTACG AAACAAAAGA CCTGACAGTT GAATGGGAAG AAGTTCCAGC 10 8 0 
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GGAAAATCTG 


ACAAAAGCAG 


GTCAATTTAC 




Lu 1 u 1 x ±\j 


C4T AG TAAC C T 


1140 


TGTTGCTGAG 


ATCACTGTAC 


GAGTGACAGA 


C AAAC 1 J. 1 


H A C A P TH TTT 
oAuri^- X V_ X X X 


C AG AT AAC C C 


1200 


TAAC TATGAT 


GAAAACAGTA 


ACCAGGCCTT 


TGCTTCAGLA 


A<^L.AA 1 1 A 


HTTP! AP a A A A A 


1260 


CTCTCATGAC 


CGCGTTGACT 


ATCTCAATGA 


C GGAG AT C AT 




b 1 vbl X kp^j-ctlk^ 


1320 


AAACTGGTCA 


CCAACACCAT 


CTTCTAATCC 


AGAAGTATCA 


Gl-GGG 1 G 1 vjA 


rnrrirnrnr* r* p^TTI A 
1 1 1 1 k_ 1 \jr± 


138 0 


AAATGGTAAG 


ATTGTAGAAC 


GGACTGTTAC 


ACAAGGAAAA 


GTTCAGI 1L1 




144 0 


TGGTACGGAT 


GCACCATCTA 


AACTCGTTTT 


AGAACGCTAT 


GTC-GG 1 L-CALj 


A^j 111 bAAb 1 


1500 


GCCAACCTAC 


TATTCAAACT 


ACCAAGCCTA 


CGACGCAGAC 


CATLLAl 1LA 


A O 7a 7A T'P 1 PAP! 


156 0 


AAATTGGGAA 


GCTGTTCCTT 


ATCGTGCGGA 


TAAAGACATT 


GC AGC TGG I G 


A TP A A 7a T>P A A 
A 1 bAAA 1 I— Art. 


162 0 


CGTAACATTT 


AAAGCTATCA 


AAGCCAAAGC 


TATGAGATGG 


C GTA 1 GC?A(jL 


pfpi A 7VPP APIA 

1 rart-rt.^j7 ^- jHLVjrt. 


1680 


TAAGAGCGGT 


GTTGC GATG A 


TTGAGATGAC 


CTTCCTTGCA 


C CAAb 1 bAA 1 


TPPPTP A APZA 


1740 


AAGCACTCAA 


TCAAAGATTC 


TTGTAGATGG 


AAAAG AAC TT 


GCTGA1 1 


prppa A A A TP PI 


1800 


TCAAGACTAT 


CAAATTACCT 


ATAAAGGTCA 


ACGGCCAAAA 


GTLTLAbl lb 


A APAAAAPAA 


1860 


TCAAGTAGCT 


TCAACTGTGG 


TAGATAGTGG 


AGAAGATAGC 


1 J. ILtAbiAL 


X lbi xv^vjv^^—x 


1920 


CGTTTCAGAA 


AGTGGAAAAC 


AAGTCAAGGA 


ATACCGTATC 


CACTTGACTA 


AGGAAAAACC 


-1 n 0 n 
198 0 


/ 

AGTTTCTGAG 


AAGACAGTTG 


CTGCTGTACA 


AGAAGATCTT 


CCAAAAATCG 


AATTTGTTGA 


2040 


AAAAGATTTG 


GCATACAAGA 


CAGTTGAGAA 


AAAAGATTCA 


ACACTGTATC 


TAGGTGAAAC 


2100 


TCGTGTAGAA 


CAAGAAGGAA 


AAGTTGGAAA 


AGAACGTATC 


TTTACAGCGA 


TTAATCCTGA 


2160 


TGGAAGTAAG 


GAAGAAAAAC 


TCCGTGAAGT 


GGTAGAAGTT 


CCGACAGACC 


GCATCGTCTT 


2220 


GGTTGGAACC 


AAACCAGTAG 


CTCAAGAAGC 


TAAAAAACCA 


CAAGTGTCAG 


AAAAAGCAGA 


2280 


TACAAAACCA 


ATTGATTCAA 


GTGAAGCTAG 


TCAAACTAAT 


AAAGCCCAG 
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(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 

(B) TYPE: amino acid 

( C ) STRAND EDNES S : S ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Ala Lys Val Ala Trp Asp Ala He Arg Pro Glu Gin Tyr Ala Lys Glu 
1 5 10 15 

Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin Leu Thr Thr 
20 25 30 

Lvs Leu His Val Arg Val Ser Ala Gin Thr Glu Gin Gly Ala Asn He 
35 40 45 
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Ser Asp Gin Trp Thr 
50 

Ser Asn Pro Ser Asp 
65 

Tyr Asn Asn Gin Pro 
85 

Pro Glu Ala Ser Val 
100 

Lys Arg Ser Val Asp 
115 

Val Gly Val Pro Lys 
130 

Val Pro Thr Ala Pro 
145 

Val Phe Asn Asp Ser 
165 

Pro Ala Gin Leu Lys 
180 

Val Glu Thr Tyr Ala 
195 

Arg Gly Thr Ser lie 
210 

Ala Ala Lys Gin Gly 
225 

Ala Asn Phe Asn Pro 
245 

Gly Lys Val Pro Ala 
260 

Thr Val Val Pro Ser 
275 

Lys Ala Glu Asn Gly 
290 

Lys Asp Lys Ser Leu 
305 

Ala Arg Leu Leu Gin 
325 

Pro Val Tyr Phe Thr 
340 

Val Glu Trp Glu Glu 
355 

Phe Thr Val Arg Gly 
370 



Gly Ser Glu Leu Pro Leu 
55 

Pro Val Ser Asn Val Asn 
70 75 

Ala Asn Arg Trp Thr Asn 
90 

Gly Val Leu Phe Gly Asp 
105 

Asn Leu Ser Val Gly Phe 
120 

Ser Tyr Val lie Glu Tyr 
135 

Lys Asn Pro Ser Phe Val 
150 155 

Ala Asn Trp Lys Pro Val 
170 

Ala Gly Glu Met Asn His 
185 

Val Arg lie Arg Met Val 
200 

Thr Glu Val Gin lie Phe 
215 

Gin Thr Arg lie Gin Val 
230 235 

Asp Leu Thr Asp Tyr Tyr 
250 

Val Thr Ala Ser Val Ser 
265 

Val Arg Glu Gly Glu Pro 
280 

Asp He Leu Gly Glu Tyr 
295 

Leu Ser His Lys Pro Val 
310 315 

Val Gly Gin v Ala Leu Glu 
330 

Gly Lys Asp Gly Tyr Glu 
345 

Val Pro Ala Glu Asn Leu 
360 

Arg Val Leu Gly Ser Asn 
375 



Ala Phe Ala Ser Asp 
60 

Asp Lys Leu He Ser 
80 



Trp Asn Arg Thr Asn 
95 

Ser Gly He Leu Ser 
110 

His Glu Asp His Gly 
125 

Tyr Val Gly Lys Thr 
140 

Gly Asn Glu Asp His 
160 

Thr Asn Leu Lys Ala 
175 



Phe Ser Phe Asp Lys 
190 

Lys Ala Asp Asn Lys 
205 

Ala Lys Gin Val Ala 
220 

Asp Gly Lys Asp Leu 
240 

Leu Glu Ser Val Asp 
255 

Asn Asn Gly Leu Ala 
270 

Val Arg Val He Ala 
285 

Arg Leu His Phe Thr 
300 

Ala Ala Val Lys Gin 
320 

Leu Pro Thr Lys Val 
335 

Thr Lys Asp Leu Thr 
350 

Thr Lys Ala Gly Gin 
365 

Leu Val Ala Glu He 
380 
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Thr Val Arg Val Thr Asp Lys Leu Gly Glu Thr Leu Ser Asp Asn Pro 
385 390 395 400 

Asn Tyr Asp Glu Asn Ser Asn Gin Ala Phe Ala Ser Ala Thr Asn Asp 
405 410 415 

lie Asp Lys Asn Ser His Asp Arg Val Asp Tyr Leu Asn Asp Gly Asp 
420 425 430 

His Ser Glu Asn Arg Arg Trp Thr Asn Trp Ser Pro Thr Pro Ser Ser 
435 440 445 

Asn Pro Glu Val Ser Ala Gly Val lie Phe Arg Glu Asn Gly Lys He 
450 455 460 

Val Glu Arg Thr Val Thr Gin Gly Lys Val Gin Phe Phe Ala Asp Ser 
465 470 475 480 

Gly Thr Asp Ala Pro Ser Lys Leu Val Leu Glu Arg Tyr Val Gly Pro 
485 490 495 

Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Tyr Gin Ala Tyr Asp Ala 
500 505 510 

Asp His Pro Phe Asn Asn Pro Glu Asn Trp Glu Ala Val Pro Tyr Arg 
515 . 520 525 

Ala Asp Lys Asp He Ala Ala Gly Asp Glu He Asn Val Thr Phe Lys 
530 535 540 

Ala He Lys Ala Lys Ala Met Arg Trp Arg Met Glu Arg Lys Ala Asp 
545 550 555 560 

Lvs Ser Gly Val Ala Met He Glu Met Thr Phe Leu Ala Pro Ser Glu 
565 570 575 

Leu Pro Gin Glu Ser Thr Gin Ser Lys He Leu Val Asp Gly Lys Glu 
580 585 590 

Leu Ala Asp Phe Ala Glu Asn Arg Gin Asp Tyr Gin He Thr Tyr Lys 
595 600 605 

Gly Gin Arg Pro Lys Val Ser Val Glu Glu Asn Asn Gin Val Ala Ser 
610 615 620 

Thr Val Val Asp Ser Gly Glu Asp Ser Phe Pro Val Leu Val Arg Leu 
625 630 635 640 

Val Ser Glu Ser Gly Lys Gin Val Lys Glu Tyr Arg He His Leu Thr 
645 650 655 

Lvs Glu Lys Pro Val Ser Glu Lys Thr Val Ala Ala Val Gin Glu Asp 
660 665 670 

Leu Pro Lys He Glu Phe Val Glu Lys Asp Leu Ala Tyr Lys Thr Val 
675 680 685 

Glu Lvs Lys Asp Ser Thr Leu Tyr Leu Gly Glu Thr Arg Val Glu Gin 
690 695 700 

Glu Gly Lys Val Gly Lys Glu Arg He Phe Thr Ala He Asn Pro Asp 
705 710 715 720 

Gly Ser Lys Glu Glu Lys Leu Arg Glu Val Val Glu Val Pro Thr Asp 
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725 730 735 

Arg lie Val Leu Val Gly Thr Lys Pro Val Ala Gin Glu Ala Lys Lys 
740 745 750 

Pro Gin Val Ser Glu Lys Ala Asp Thr Lys Pro He Asp Ser Ser Glu 
755 760 765 

Ala Ser Gin Thr Asn Lys Ala Gin 
770 775 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

C TATC ACTAT GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TTCAGACAGG 6 0 

AAAGCCTGAC AGGAATGAAG TTGTATATGG. TTTGGTGTAT C AAAAAGAT C AGTTGCCTCA 12 0 
AACAGGGACA GAA 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Tyr His Tyr Val Asn Lys Glu He He Ser Gin Glu Ala Lys Asp Leu 
1 5 10 15 - 

He Gin Thr Gly Lys Pro Asp Arg Asn Glu Val Val Tyr Gly Leu Val 
20 25 30 

Tvr Gin Lys Asp Gin Leu Pro Gin Thr Gly Thr Glu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE- CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TGAGACTCCT CAATCAATAA CAAATCAGGA GCAAGCTAGG AC AGAAAAC C AAGTAGTAGA 



60 
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GACAGAGGAA GCTCCAAAAG AAGAAGC AC C TAAAACAGAA GAAAGTCCAA AGGAAGAACC 12 0 

AAAATC GGAG GTAAAACCTA CTGAC GAC AC CCTTCCTAAA GTAGAAGAGG GGAAAGAAGA 180 

TTCAGCAGAA CCAGCTCCAG TTGAAGAAGT AGGTGGAGAA GTTGAGTCAA AACCAGAGGA 240 

AAAAGTAGCA GTTAAGC C AG AAAGTCAACC ATCAGACAAA CCAGCTGAGG AATCAAAAGT 3 00 

TGAACAAGCA GGTGAAC C AG TCGCGCCAAG AGAAGACGAA AAGGCACCAG TCGAGCCAGA 3 60 

AAAGCAACCA GAAGCTCCTG AAGAAGAGAA GGCTGTAGAG GAAAC AC C G A AACAAGAAGA 42 0 

GTCAACTCCA GATACCAAGG CTGAAGAAAC TGTAGAACCA AAAGAGGAGA C TGTTAATC A 48 0 

ATCTATTGAA CAACCAAAAG TTGAAACGCC TGC TGTAGAA AAACAAACAG AACCAACAGA 540 

GGAACCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AG AG AAG AC G AACAGGCACC 6 00 

AACGGCACCA GTTGAGCCAG AAAAGCAACC AG AAGTTC C T GAAGAAGAGA AGGCTGTAGA 6 60 

GGAAACACCG AAACCAGAAG ATAAAATAAA GGGTATTGGT ACTAAAGAAC CAGTTGATAA 72 0 

AAGTGAGTTA AATAATCAAA TTGATAAAGC TAGTTCAGTT TCTCCTACTG ATTAT 77 5 
(2) INFORMATION FOR SEQ ID NO : 9 0.: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

" Glu Thr Pro Gin Ser lie Thr Asn Gin Glu Gin Ala Arg Thr Glu Asn 
1-5 10 15 

Gin Val Val Glu Thr Glu Glu Ala Pro Lys Glu Glu Ala Pro Lys Thr 
20 25 30 

Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Glu Val Lys Pro Thr Asp 
35 40 45 

Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp Ser Ala Glu Pro 
50 55 60 

Ala Pro Val Glu Glu Val Gly Gly Glu Val Glu Ser Lys Pro Glu Glu 
65 70 75 80 

Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp Lys Pro Ala Glu 
85 90 95 

Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala Pro Arg Glu Asp 
100 105 110 

Glu Lvs Ala Pro Val Glu Pro Glu Lys Gin Pro Glu Ala Pro Glu Glu 
115 120 125 

Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu Ser Thr Pro Asp 
130 135 140 
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Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Glu Glu Thr Val Asn Gin 
145 150 155 160 

Ser He Glu Gin Pro Lys Val Glu Thr Pro Ala Val Glu Lys Gin Thr 
165 170 175 

Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
180 185 190 

Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val Glu Pro Glu Lys 
195 . 200 205 

Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys 
210 215 220 

Pro Glu Asp Lys He Lys Gly He Gly Thr Lys Glu Pro Val Asp Lys 
225 230 235 240 

Ser Glu Leu Asn Asn Gin He Asp Lys Ala Ser Ser Val Ser Pro Thr 
245 250 255 

Asp Tyr 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
GGATGCTCAA GAAACTGCGG GAGTTCACTA TAAATATGTG GCAGATTCAG AGCTATCATC 6 0 

AGAAGAAAAG AAGCAGCTTG TCTATGATAT TCCGACATAC GTGGAGAATG ATGATGAAAC 12 0 

TTATTATCTT GTTTATAAGT TAAATTCTCA AAATCAACTG GCGGAATTGC CAAATACTGG 180 
AAGCAAGAAT GAGAGGCAA 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Asp Ala Gin Glu Thr Ala Gly Val His Tyr Lys Tyr Val Ala Asp Ser 
1 5 10 15 

Glu Leu Ser Ser Glu Glu Lys Lys Gin Leu Val Tyr Asp He Pro Thr 
20 25 30 



199 



98 

Tvr Val Glu Asn Asp Asp Glu Thr Tyr Tyr Leu Val Tyr Lys Leu Asn 
35 40 45 

Ser Gin Asn Gin Leu Ala Glu Leu Pro Asn Thr Gly Ser Lys Asn Glu 
50 55 60 

Arg Gin 
65 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CGACAAAGGT GAGACTGAGG TTCAACCAGA GTCGCCAGAT AC TGTGGT AA GTGATAAAGG 6 0 

TGAACCAGAG CAGGTAGCAC CGCTTCCAGA ATATAAGGGT AATATTGAGC AAGT AAAAC C 120 

TGAAACTCCG GTTGAGAAGA C C AAAGAAC A AGGTCCAGAA AAAAC TGAAG AAGTTCCAGT 18 0 

AAAACCAACA GAAGAAACAC CAGTAAATCC AAATGAAGGT AC T AC AGAAG GAACCTCAAT 2 40 

TCAAGAAGCA GAAAATC C AG TTCAACCTGC AGAAGAATCA ACAACGAATT CAGAGAAAGT 3 00 

ATCACCAGAT ACATCTAGCA AAAAT AC TGG GGAAGTGTCC AGTAATCCTA GTGATTC GAC 3 60 

AACCTCAGTT GGAGAATCAA ATAAACCAGA ACATAATGAC TC T AAAAATG AAAATTCAGA 42 0 

AAAAAC TGT A GAAGAAGTTC CAGTAAATCC AAATGAAGGC ACAGTAGAAG GTACCTCAAA 480 

TCAAGAAACA G AAAAAC C AG TTCAACCTGC AGAAGAAACA CAAACAAACT C TGGG AAAAT 54 0 

AGCTAACGAA AAT AC TGGAG AAGTATCCAA TAAACCTAGT GATTCAAAAC CACCAGTTGA 600 

AGAATCAAAT CAACCAGAAA AAAAC GG AAC TGCAACAAAA C C AG AAAATT CAGGTAATAC 660 

AACATCAGAG AATGGACAAA CAGAACCAGA ACCATCAAAC GGAAATTCAA CTGAGGATGT 72 0 

TTCAACCGAA TC AAAC AC AT CCAATTCAAA TGGAAACGAA GAAATTAAAC AAGAAAATGA 7 80 

ACTAGACCCT GATAAAAAGG TAGAAGAACC AGAGAAAACA CTTGAATTAA GAAAT 835 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Asp Lys Gly Glu Thr* Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val 
15 10 15 
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Ser Asp Lys Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys 
20 25 30 

Gly Asn He Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys 
35 40 45 

Glu Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu 
50 55 60 

Glu Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Glu Gly Thr Ser He 
65 70 75 80 

Gin Glu Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn 
85 90 95 

Ser Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val 
100 105 HO 

Ser Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asn Lys 
115 120 125 

Pro Glu His Asn Asp Ser Lys Asn Glu Asn Ser Glu Lys Thr Val Glu 
130 • 135 140 

Glu Val Pro Val Asn Pro Asn. Glu Gly Thr Val Glu Gly Thr Ser Asn 
145 150 155 160 

Gin Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn 
165 170 175 

Ser Glv Lys He Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro 
180 185 190 

Ser Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn 
195 200 205 

Glv Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Thr Thr Ser Glu Asn 
210 215 220 

Glv Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val 
225 230 235 240 

Ser Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu He Lys 
245 250 255 

Gin Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys 
260 265 270 

Thr Leu Glu Leu Arg Asn 
275 

INFORMATION FOR SEQ ID NO : 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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Thr Ala Glu Lys Ser Thr Val Lys Ala Pro Ala Gin Arg Val Asp Val 
20 .25 30 

Lvs Asp He Thr His Leu Thr Asp Glu Glu Lys Val Lys Val Ala He 
35 40 45 

Leu Gin Ala Asn Gly Ser Ala Leu Asp Gly Ala Thr He Asn Val Ala 
50 55 60 

Gly Asp Gly Thr Ala Thr He Thr Phe Pro Asp Gly Ser Val Val Thr 
65 70 75 80 

He Leu Gly Lys Asp Thr Val Gin Gin Ser Ala Lys Gly Glu Ser Val 
85 90 95 

Thr Gin Glu Ala Thr Pro Glu Tyr Lys Leu Glu Asn Thr Pro Gly Gly 
100 105 HO 

Asp Lys Gly Gly Asn Thr Gly Ser Ser Asp Ala Asn Ala Asn Glu Gly 
115 120 125 

Gly Gly Ser Gin Ala Gly Gly Ser Ala His Thr Gly Ser Gin Asn Ser 
130 135 140 
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AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC ACTAAACTGA CTGCTGAAAA 

ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA GATATAACTC ATTTAACAGA 12 0 

TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT TCAGCATTAG ACGGAGCGAC 18 0 

AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC CCAGATGGTT CAGTAGTGAC 2 40 

GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT GAATCTGTAA CTCAAGAAGC 3 00 

TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT AAGGGAGGCA AT AC TGGAAG 3 60 

CTCAGATGCT AATGC GAATG AAGGC GGTGG TAGCCAGGCG GGTGGATCAG CTCACACAGG 42 0 

TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA GC T AC TG AAA AAGAATCAGC 48 0 

TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT GAAATCAAAG GCGCACCGCT 540 

TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA GCAGAAAAAC AAGCAGCTCT 600 

CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG GAAGCAGAAA CGATTGGAGT 660 

GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG GCTCCTAAT 7 09 
(2) INFORMATION FOR SEQ ID NO : 9 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Asn Gin Leu Val Ala Gin Asp Pro Lys Ala Gin Asp Ser Thr Lys Leu 

10 15 
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Ala Gin Ser Gin Ala Ser Lys Gin Leu Ala Thr Glu Lys Glu Ser Ala 
145 150 155 160 

Lvs Asn Ala He Glu Lys Ala Ala Lys Asp Lys Gin Asp Glu He Lys 
165 170. 175 

Glv Ala Pro Leu Ser Asp Lys Glu Lys Ala Glu Leu Leu Ala Arg Val 
180 185 190 

Glu Ala Glu Lys Gin Ala Ala Leu Lys Glu He Glu Asn Ala Lys Thr 
195 200 205 

Met Glu Asp Val Lys Glu Ala Glu Thr He Gly Val Gin Ala He Ala 
210 215 220 

Met Val Thr Val Pro Lys Arg Pro Val Ala Pro Asn 
225 230 235 

INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 87 base pairs 

(B) TYPE:, nucleic acid 

( C ) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CAAACAGTCA GCTTCAGGAA CGATTGAGGT GATTTC AC G A GAAAATGGCT CTGGGACACG 60 

GGGTGCCTTC ACAGAAATCA CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA 120 

CACTGCCAAA ACAGCTGTGA TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG 180 

GAATGC TAAT GCTATCGGCT ACATCTCCTT GGGATCTTTA AC GAAATC TG TCAAGGCTTT 240 

AGAGATTGAT GGTGTCAAGG CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA 3 00 

ACGTCCCTTC AACATTGTTT GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG 3 60 

CTTTATCCAC TCCAAACAAG GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA 420 

AACCGAAACC AC GGAAT AT A CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC 480 

CACTTCAGTA TCTTCTTTAA TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATC C AG A 540 

AGTTAC GATT GAT ATT AC CT CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA 6 00 

AACCGCTGAT ATTGGTATGG TTTCTAGGGA ATT AAC TC C T GAAGAAGGTA AGAGTCTCAC 660 

CCATGATGCT ATTGCTTTAG AC GGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG 720 

CCAAGTCAGT ATGGCTGAAC TTGCAGACGT TTTTAGTGGC AAATT AAC C A CCTGGGACAA 7 80 

787 

GATTAAA 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Lvs Gin Ser Ala Ser Gly Thr lie Glu Val lie Ser Arg Glu Asn Gly 
. tr in is 



15 10 15 

Ser Gly Thr Arg Gly Ala Phe Thr Glu lie Thr Gly lie Leu Lys Lys 
20 25 30 

Asp Gly Asp Lys Lys lie Asp Asn Thr Ala Lys Thr Ala Val He Gin 
35 40 45 

Asn Ser Thr Glu Gly Val Leu Ser Ala Val Gin Gly Asn Ala Asn Ala 
50 55 60 

He Glv Tyr He Ser Leu Gly. Ser Leu Thr Lys Ser Val Lys Ala Leu 
65 70 75 80 

Glu He Asp Gly Val Lys Ala Ser Arg Asp Thr Val Leu Asp Gly Glu 
85 90 95 

Tyr Pro Leu Gin Arg Pro Phe Asn lie Val Trp Ser Ser Asn Leu Ser 
100 105 HO 

Lys Leu Gly Gin Asp Phe He Ser Phe He His Ser Lys Gin Gly Gin 
115 120 125 

Gin Val Val Thr Asp Asn Lys Phe He Glu Ala Lys Thr Glu Thr Thr 
130 135 140 

Glu Tyr Thr Ser Gin His Leu Ser Gly Lys Leu Ser Val Val Gly Ser 
145 150 155 160 

Thr Ser Val Ser Ser Leu Met Glu Lys Leu Ala Glu Ala Tyr Lys Lys 
165 170 175 

Glu Asn Pro Glu Val Thr He Asp He Thr Ser Asn Gly Ser Ser Ala 
180 185 190 

Gly He Thr Ala Val Lys Glu Lys Thr Ala Asp He Gly Met Val Ser 
195 200 205 

Arg Glu Leu Thr Pro Glu Glu Gly Lys Ser Leu Thr His Asp Ala He 
210 215 220 

Ala Leu Asp Gly He Ala Val Val Val Asn Asn Asp Asn Lys Ala Ser 
225 230 235 240 

Gin Val Ser Met Ala Glu Leu Ala Asp Val Phe Ser Gly Lys Leu Thr 
245 250 255 

Thr Trp Asp Lys He Lys 
260 

INFORMATION FOR SEQ ID NO : 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

ATTCGATGAT GCGGATGAAA AG AT G AC C C G TGATGAAATT GCCTATATGC TGACAAATAG 60 

TGAAGAAACA TTGGATGCTG ATGAGATTGA GATGCTACAA GGTGTCTTTT CGCTCGATGA 120 

AC TGATGGC A CGAGAGGTTA TGGTTCCTCG AACGGATGCC TTTATGGTGG ATATTCAGGA 180 

TGATAGTCAA GCCATTATCC AAAGTATTTT AAAACAAAAT TATTCTCGTA TCCCGGTTTA 2 40 

TGATGGGGAT AAGGACAATG TAATTGGAAT CATTCACACC AAGAGTCTCC TTAAGGCAGG 3 00 

CTTTGTGGAC GGTTTTGACA ATATTGTTTG GAAGAGAATT TTACAAGATC CACTTTTTGT 3 60 

ACCTGAAACT ATTTTTGTGG ATGACTTGCT AAAAGAACTG CGAAATACCC AAAGACAAAT 420 
G 

(2) INFORMATION FOR SEQ ID NO: 10 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: ainino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Phe Asp Asp Ala Asp Glu Lys Met Thr Arg Asp Glu He Ala Tyr Met 
! 5 10 15 

Leu Thr Asn Ser Glu Glu Thr Leu Asp Ala Asp Glu He Glu Met Leu 
20 25 30 

Gin Gly Val Phe Ser Leu Asp Glu Leu Met Ala Arg Glu Val Met Val 
35 40 45 

Pro Arg Thr Asp Ala Phe Met Val Asp lie Gin Asp Asp Ser Gin Ala 
50 55 60 

He He Gin Ser He Leu Lys Gin Asn Tyr Ser Arg He Pro Val Tyr 
65 70 75 80 

Asp Gly Asp Lys Asp Asn Val He Gly He He His Thr Lys Ser Leu 
85 90 95 

Leu Lys Ala Gly Phe Val Asp Gly Phe Asp Asn He Val Trp Lys Arg 
100 105 HO 

He Leu Gin Asp Pro Leu Phe Val Pro Glu Thr He Phe Val Asp Asp 
115 120 125 

Leu Leu Lys Glu Leu Arg Asn Thr Gin Arg Gin Met 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 101: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GGAGAGTCGA TCAAAAGTAG ATGAAGC TGT GTCTAAGTTT GAAAAGG AC T CATCTTCTTC 6 0 

GTCAAGTTCA GACTCTTCCA CTAAACCGGA AGCTTCAGAT AC AGC GAAGC CAAACAAGCC 12 0 

GACAGAACCA GGAGAAAAGG TAGCAGAAGC TAAGAAGAAG GTTGAAGAAG CTGAGAAAAA 18 0 

AGCCAAGGAT CAAAAAGAAG AAGATCGTCG TAACTACCCA AC C ATTACTT AC AAAAC GCT 240 

TGAACTTGAA ATTGCTGAGT C C GATGTGGA AGTTAAAAAA GCGGAGCTTG AAC TAG T AAA 3 00 

AGTGAAAGCT AACGAACCTC GAG AC GAGC A A 33 1 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Glu Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp 
1 5 10 15 

Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser 
20 25 3 0 

- Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala 
35 40 45 

Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin 
50 55 60 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu 
65 70 75 80 

Glu Leu Glu He Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 
85 90 95 

Glu Leu Val' Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Gin 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

ATGGACAACA GGAAACTGGG ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC 60 

AGATATTC C A ACAGTTGAAT CACAAGAAGT T AC GTC AG AC TCTAGTGATA AAGAAATAAC 12 0 

GGTAAGGTAT GACCGTTTAT CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA 180 

TCCAAGTGTT CCGACACCAA ACCCAGAACT ACCAAATCAA GAG AC T C C AA C AC C AG AT AA 240 

ACCAACTCCA GAACCAGGTA CTCCAAAAAC TGAAACTCCA GTGAATCCAG AC C C AGAAGT 3 00 

TC C G AC TT AT GAGACAGGTA AGAGAGAGGA ATTGC C AAAC ACAGGTACAG AAGCTAAT 3 58 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Trp Thr Thr Gly Asn Trp Asp Glu Val lie Ser Gly Lys lie Asp Lys 
1 5 10 15 

Tvr Lys Asp Pro Asp lie Pro Thr Val Glu Ser Gin Glu Val Thr Ser 
20 25 30 

Asp Ser Ser Asp Lys Glu He Thr Val Arg Tyr Asp Arg Leu Ser Thr 
35 40 45 

Pro Glu Lys Pro He Pro Gin Pro Asn Pro Glu His Pro Ser Val Pro 
50 55 60 

Thr Pro Asn Pro Glu Leu Pro Asn Gin Glu Thr Pro Thr Pro Asp Lys 
65 70 75 80 

Pro Thr Pro Glu Pro Gly Thr Pro Lys Thr Glu Thr Pro Val Asn Pro 
85 90 95 

Asp Pro Glu Val Pro Thr Tyr Glu Thr Gly Lys Arg Glu Glu Leu Pro 
100 105 HO 

Asn Thr Gly Thr Glu Ala Asn 
115 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
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CGATGGGCTC 


AATCCAACCC 


L AGG I LAAb 1 


Ul 1 AL ^ 1 Urin. 


flAGAPATPGG 


GAAC GAAAG A 


60 


GGGTGAC TT A 


TCAGAAAAAC 


L AGGAGAL AC 


LL?1 lv,l>-^ J- 


paagpgaaap 


C TG AGGGC GT 


12 0 


TACTGGAAAT 


ACGAATTCAC 


TTCCGACACC 


H"» A A /" , A A A P* A 
I AL AGAAALA 




GPP, AGGAAAP 


18 0 


AAGCCCTTCT 


AGTCTGGATA 


C AC TTTTTGA 


7\ 7\ 7\ 7\ A TP" 1 A A 

AAAAG A 1 G AA 


biAAVjL I L AAA 


AAA aTrr AG A 


240 


GCTAACAGAT 


GTCTTAAAAG 


AAAC TGT AG A 


T AC AGL I bA I 


pmpp A TTfT A 
I VjLA 1 LLjLA 


papaappaap 


inn 

J u u 


TCCAGCAGAA 


ACTACTCCTG 


AACAAGTAAA 


-»v v**i /"i m /^i 7V 

AGGTGGAGTG 


AAAP*AAAATA 

AAAbAAAAlA 


L /lnA/-ibr\L ALj 


1 6 n 


CATC GATGTT 


CCTGCTGCTT 


ATC TTGAAAA 


AGC TGAAGGG 


AAAGG ILL 1 1 


1 LAL 1 vjLLoo 




TGTAAACCAA 


GTAATTC C TT 


ATGAAC TATT 


CGCTGGTGAT 


/~»r-TTi7\ TP*TT A A 
GG 1 A 1 L 1 1 AA 


/-« rp/"-« /-^ rp/-i rp a TT 




ACTAAAAGCT 


TCGGATAATG 


CTCCTTGGTC 


TGACAATGGT 


AL 1 GL J. AAAA 


A 1 LL 1 LL 111 


R a n 


ACCTCCTCTT 


GAAGGATTAA 


CAAAAGGGAA 


ATACTTCTAT 


P 1 A A PTA pfiPT 
brAAL* I ALAL 1 


TA A A TClfZC A a 


60 0 


TACTGTTGGT 


AAACAAGGTC 


AAGCTTTAAT 


TGATCAACTT 


CGCGCTAATG 


PT&PTP a a ap 

Vj 1 AL 1 L AAAL 


660 


TT AT AAAGC T 


AC TGTT AAAG 


TTT AC GGAAA 


TAAAGACGGT 


AAAGC TG ACT 


TP2 A PT A aTPT 
1 uA^. 1 .rt-rt. 1 L 1 


72 0 


AGTTGCTACT 


AAAAATGTAG 


ACATCAACAT. CAATGGATTA 


GTTGC T AAAG 


A A A P A P,TTP A 


7 8 0 


AAAAGCCGTT 


GCAGACAACG 


TTAAAGACAG 


T ATC GATGTT 


CCAGCAGCCT 


appT^paa2i2i 

nL L 1 HuAJvift. 


0 *± w 


AGCCAAGGGT 


GAAGGTCCAT 


TCACAGCAGG 


TGTCAACCAT 


GTGATTC CAT 


7ipp]\ 7\ prpprprn 
AL oAAL 1 L 1 1 




CGCAGGTGAT 


GGCATGTTGA 


CTCGTCTCTT 


GCTCAAGGCA 


TCTGACAAGG 


L ALL A 1 (jLt 1 L 


q fi n 


AGATAAC GGC 


GACGCTAAAA 


ACCCAGCCCT 


ATC TC C AC T A GGCGAAAACG 


rn/~< Tvipappaa 

1 LrAALsAL L AA 


1 non 
x u £ u 


AGGTCAATAC 


TTCTATCAAN 


TAGCCTTGGA 


C GG AAATGT A 


GCTGGCAAAG 


AAAAAPAAPP 1 
AAAAALAALL 


1UO u 


GCTCATTGAC 


CAGTTCCGAG 


CAAANGGTAC 


TCAAACTTAC 


AGC GC T AC AG 


1 L AA IblLlA 




TGGT AAC AAA 


GACGGTAAAC 


CAGACTTGGA 


CAACATCGTA 


GCAACTAAAA 


a A pqip APTAT 1 
AAu 1 L AL 1 A 1 


19 0 0 
x ^ u u 


TAACATAAAC 


GGTTTAATTT 


CTAAAGAAAC 


AGTTCAAAAA 


GCCGTTGCAG 


A P^ A APp TT A. A 
ALAALb 1 1 AA 


1 9 n 


NGACAGTATC 


GATGTTCCAG 


CAGCCTACCT 


AG AAAAAGC C 


AAGGGTGAAG 


prpnp ATTP AP 
LrlLLAl 1LAL 


X -J -£ w 


AGCAGGTGTC 


AAC C ATGTGA 


TTC CAT AC GA 


ACTCTTCGCA 


GGTGATGGTA 


TP^TTPJZlPTPC 
lull on.L 1 ^_ vj 


13 8 0 


TCTCTTGCTC 


AAGGCATCTG 


AC AAGGC AC C 


ATGGTCAGAT 


AACGGNGACG 


PTA A A A APPP 


1440 


AGCNCTATCT 


C C AC T AGGTG 


AAAACGTGAA 


G AC C AAAGGT 


CAATACTTCT 


ATCAANTAGC 


1500 


CTTGGACGGA 


AATGTAGCTG 


GCAAAGAAAA 


ACAAGCGCTC 


ATTGACCAGT 


TC C GAGC AAA 


1560 


CGGTACTCAA 


ACTTACAGCG 


CTACAGTCAA 


TGTCTATGGT 


AAC AAAGAC G 


GT AAAC C AG A 


1620 


CTTGGACAAC 


ATCGTAGCAA 


CTAAAAAAGT 


CACTATTAAG 


ATAAATGTTA 


AAGAAACATC 


1680 


AGACACAGCA 


AATGGTTCAT 


TATCACCTTC 


TAACTCTGGT 


TCTGGCGTGA 


CTCCGATGAA 


1740 


TCACAATCAT 


GCTACAGGTA 


C T AC AGAT AG 


CATGCCTGCT 


GACACCATGA 


C AAGTTC T AC 


1800 


CAACACGATG 


GCAGGTGAAA 


ACATGGCTGC 


TTCTGCTAAC 


AAGATGTCTG 


ATACGATGAT 


1860 


GTCAGAGGAT 


AAAGC TATG 
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INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 6: 

Asp Gly Leu Asn Pro Thr Pro Gly Gin Val Leu Pro Glu Glu Thr Ser 
15 10 15 

Gly Thr Lys Glu Gly Asp Leu Ser Glu Lys Pro Gly Asp Thr Val Leu 
20 25 30 

Thr Gin Ala Lys Pro Glu Gly Val Thr Gly Asn Thr Asn Ser Leu Pro 
35 40 45 

Thr Pro Thr Glu Arg Thr Glu Val Ser Glu Glu Thr Ser Pro Ser Ser 
50 55 60 

Leu Asp Thr Leu Phe Glu Lys Asp Glu Glu Ala Gin Lys Asn Pro Glu 
65 70 75 80 

Leu Thr Asp Val Leu Lys Glu Thr Val Asp Thr Ala Asp Val Asp Gly 
85 90 95 

Thr Gin Ala Ser Pro Ala Glu Thr Thr Pro Glu Gin Val Lys Gly Gly 
100 105 HO 

Val Lys Glu Asn Thr Lys Asp Ser lie Asp Val Pro Ala Ala Tyr Leu 
115 120 125 

Glu Lys Ala Glu Gly Lys Gly Pro Phe Thr Ala Gly Val Asn Gin Val 
130 135 140 

lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr Arg Leu Leu 
145 150 155 160 

Leu Lys Ala Ser Asp Asn Ala Pro Trp Ser Asp Asn Gly Thr Ala Lys 
165 170 175 

Asn Pro Ala Leu Pro Pro Leu Glu Gly Leu Thr Lys Gly Lys Tyr Phe 
180 185 190 

Tyr Glu Val Asp Leu Asn Gly Asn Thr Val Gly Lys Gin Gly Gin Ala 
195 200 205 

Leu He Asp Gin Leu Arg Ala Asn Gly Thr Gin Thr Tyr Lys Ala Thr 
210 215 220 

Val Lys Val Tyr Gly Asn Lys Asp Gly Lys Ala Asp Leu Thr Asn Leu 
225 230 235 240 

Val Ala Thr Lys Asn Val Asp He Asn He Asn Gly Leu Val Ala Lys 
245 250 255 

Glu Thr Val Gin Lys Ala Val Ala Asp Asn Val Lys Asp Ser He Asp 
260 265 270 
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Val Pro Ala Ala Tyr Leu Glu Lys Ala Lys Gly Glu Gly Pro Phe Thr 
275 280 285 

Ala Gly Val Asn His Val lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly 
290 295 300 

Met Leu Thr Arg Leu Leu Leu Lys Ala Ser Asp Lys Ala Pro Trp Ser 
305 310 315 320 

Asp Asn Gly Asp Ala Lys Asn Pro Ala Leu Ser Pro Leu Gly Glu Asn 
325 330 335 

Val Lys Thr Lys Gly Gin Tyr Phe Tyr Gin Xaa Ala Leu Asp Gly Asn 
340 345 350 

Val Ala Gly Lys Glu Lys Gin Ala Leu lie Asp Gin Phe Arg Ala Xaa 
355 360 365 

Gly Thr Gin Thr Tyr Ser Ala Thr Val Asn Val Tyr Gly Asn Lys Asp 
370 375 380 

Glv Lys Pro Asp Leu Asp Asn He Val Ala Thr Lys Lys Val Thr He 
385 390 395 400 

Asn He Asn Gly Leu He Ser Lys Glu Thr Val Gin Lys Ala Val Ala 
405 410 415 

Asp Asn Val Xaa Asp Ser He Asp Val Pro Ala Ala Tyr Leu Glu Lys 
420 425 430 

Ala Lys Gly Glu Gly Pro Phe Thr Ala Gly Val Asn His Val He Pro 
435 440 445 

Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr Arg Leu Leu Leu Lys 
450 455 460 

Ala Ser Asp Lys Ala Pro Trp Ser Asp Asn Gly Asp Ala Lys Asn Pro 
465 470 475 480 

Ala Leu Ser Pro Leu Gly Glu Asn Val Lys Thr Lys Gly Gin Tyr Phe 
485 490 495 

TVr Gin Xaa Ala Leu Asp Gly Asn Val Ala Gly Lys Glu Lys Gin Ala 
500 505 510 

Leu He Asp Gin Phe Arg Ala Asn Gly Thr Gin Thr Tyr Ser Ala Thr 
■ sis 520 525 

Val Asn Val Tyr Gly Asn Lys Asp Gly Lys Pro Asp Leu Asp Asn He 
530 535 540 

Val Ala Thr Lys Lys Val Thr He Lys He Asn Val Lys Glu Thr Ser 
545 550. 555 560 

Asp Thr Ala Asn Gly Ser Leu Ser Pro Ser Asn Ser Gly Ser Gly Val 
565 570 575 

Thr Pro Met Asn His Asn His Ala Thr Gly Thr Thr Asp Ser Met Pro 
580 585 590 

Ala Asp Thr Met Thr Ser Ser Thr Asn Thr Met Ala Gly Glu Asn Met 
595 600 605 
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Ala Ala Ser Ala Asn Lys Met Ser Asp Thr Met Met Ser Glu Asp Lys 
610 615 620 

Ala Met 
625 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

TTCCAATCAA AAACAGGCAG ATGGTAAACT CAATATCGTG ACAACCTTTT ACCCTGTCTA 6 0 

TGAr TTTAC C AAGCAAGTCG CAGGAGATAC GGC T AATGT A GAACTCCTAA TCGGTGCTGG 12 0 

GACAGAACCT CATGAATACG AACCATCTGC CAAGGCAGTT GCCAAAATCC AAGATGCAGA 180 

TACCTTCGTT TATGAAAATG AAAAC AT GG A AACATGGGTA CCTAAATTGC TAGATACCTT 240 

GGATAAGAAA AAAGTGAAAA CCATCAAGGC GACAGGCGAT ATGTTGCTCT TGCCAGGTGG 3 00 

C GAGGAAGAA GAGGGAGACC ATGACCATGG AGAAGAAGGT CATCACCATG AGTTTGACCC 3 60 

CCATGTTTGG TTATCACCAG TTCGTGC CAT tAAACTAGTA GAGCACCATC CGCGACACTT 42 0 

GTCAGCAGAT TATCCTGATA AAAAAGAGAC CTTTGAGAAG AATGCAGCTG CCTATATCGA 48 0 

AAAATTGCAA GC C TTGG AT A AGGCTTACGC AGAAGGTTTG TCTCAAGCAA AACAAAAGAG 540 

CTTTGTGACT CAACACGCAg CCTTTAACTa TCTTGCCTTG GACTATGGGA CTC 593 
(2) INFORMATION FOR SEQ ID NO: 108: 

( i ) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Ser Asn Gin Lys Gin Ala Asp Gly Lys Leu Asn He Val Thr Thr Phe 
1 5 10 15 

Tyr Pro Val Tyr Glu Phe Thr Lys Gin Val Ala Gly Asp Thr Ala Asn 
20 25 30 

Val Glu Leu Leu He Gly Ala Gly Thr Glu Pro His Glu Tyr Glu Pro 
35 40 45 

Ser Ala Lys Ala Val Ala Lys He Gin Asp Ala Asp Thr Phe Val Tyr 
50 55 60 

Glu Asn Glu Asn Met Glu Thr Trp Val Pro Lys Leu Leu Asp Thr Leu 
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70 75 80 



Asp Lys Lys Lys Val Lys Thr lie Lys Ala Thr Gly Asp Met Leu Leu 
85 90 95 

Leu Pro Gly Gly Glu Glu Glu Glu Gly Asp His Asp His Gly Glu Glu 
100 105 HO 

Glv His His His Glu Phe Asp Pro His Val Trp Leu Ser Pro Val Arg 
115 120 125 

Ala He Lys Leu Val Glu His His Pro Arg His Leu Ser Ala Asp Tyr 
130 135 140 

Pro Asp Lys Lys Glu Thr Phe Glu Lys Asn Ala Ala Ala Tyr He Glu 
145 150 155 160 

Lys Leu Gin Ala Leu Asp Lys Ala Tyr Ala Glu Gly Leu Ser Gin Ala 
165 170 175 

Lys Gin Lys Ser Phe Val Thr Gin His Ala Ala Phe Asn Tyr Leu Ala 
180 185 190 

Leu Asp Tyr Gly Thr 
195 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 109: 

TATCACAGGA TCGAACGGTA AGACAACCAC AAC GACTATG ATTGGGGAAG TTTTGACTGC 60 

TGCTGGCCAA CATGGTCTTT TATCAGGGAA TATCGGCTAT CCAGCTAGTC AGGTTGCTCA 12 0 

AATAGCATCA GATAAGGACA CGCTTGTTAT GGAACTTTCT TCTTTC C AAC TCATGGGTGT 180 

TCAAGAATTC CATCCAGAGA TTGC GGTT AT TACCAACCTC ATGCCAACTC ATATCGACTA 240 

CCATGGGTCA TTTTCGGAAT ATGTAGCAGC CAAGTGGAAT ATCCAGAACA AGATGACAGC 3 00 

AGCTGATTTC C TTGTC TTG A AC TTT AATC A AGAC TTGGC A AAAGAC TTGA CTTCCAAGAC 3 60 

AGAAGCCACT GTTGTACCAT TTTCAACACT TGAAAAGGTT GATGGAGCTT ATC TGGAAGA 42 0 

TGGTCAACTC TACTTCCGTG GTGAAGTAGT CATGGCAGCG AATGAAATCG GTGTTCCAGG 480 

TAGCCACAAT GTGGAAAATG CCCTTGCGAC TATTGCTGTA GCCAAGCTTC GTGATGTGGA 540 

CAATCAAACC ATCAAGGAAA CTCTTTCAGC CTTCGGTGGT GTCAAACACC GTCTCCAGTT 600 

TGTGGATGAC ATCAAGGGTG TTAAATTCTA TAACGACAGT AAATCAACTA ATATC TTGGC 660 

TACTCAAAAA GCCTTGTCAG GATTTGACAA CAGCAAGGTC GTC TTGATTG CAGGTGGTTT 72 0 

GGACCGTGGC AATGAGTTTG AC G AATTGGT GC C AGAC ATT AC TGG AC TC A AGAAGATGGT 7 80 
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CATCCTGGGT CAATCTGCAG AACGTGTCAA ACGGGCAGCA GACAAGGCTG GTGTCGCTTA 840 

TGTGGAGGCG ACAGATATTG CAGATGCGAC CCGCAAGGCC TATGAGCTTG CGACTCAAGG 9 00 

AGATGTGGTT CTTCTTAGTC CTGCCAATGC TAGCTGGGAT ATGTATGCTA ACTTTGAAGT 9 60 

ACGTGGCGAC CTCTTTATCG AC AC AGT AGC GGAGTTAAAA GAA 1003 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly lie Thr Gly Ser Asn Gly Lys Thr Thr Thr Thr Thr Met lie Gly 
15 10 15 

Glu Val Leu Thr Ala Ala Gly Gin His Gly Leu Leu Ser Gly Asn lie 
20 25 30 

Gly Tyr Pro Ala Ser Gin Val Ala Gin lie Ala Ser Asp Lys Asp Thr 
35 40 45 

Leu Val Met Glu Leu Ser Ser Phe Gin Leu Met Gly Val Gin Glu Phe 
50 55 60 

His Pro Glu lie Ala Val lie Thr. Asn Leu Met Pro Thr His lie Asp 
65 70 75 80 

Tyr His Gly Ser Phe Ser Glu Tyr Val Ala Ala Lys Trp Asn lie Gin 
85 90 95 

Asn Lys Met Thr Ala Ala Asp Phe Leu Val Leu Asn Phe Asn Gin Asp 
100 105 110 

Leu Ala Lys Asp Leu Thr Ser Lys Thr Glu Ala Thr Val Val Pro Phe 
115 120 125 

Ser Thr Leu Glu Lys Val Asp Gly Ala Tyr Leu Glu Asp Gly Gin Leu 
130 135 140 

Tyr Phe Arg Gly Glu Val Val Met Ala Ala Asn Glu lie Gly Val Pro 
145 150 155 160 

Gly Ser His Asn Val Glu Asn Ala Leu Ala Thr He Ala Val Ala Lys 
165 170 175 

Leu Arg Asp Val Asp Asn Gin Thr He Lys Glu Thr Leu Ser Ala Phe 
180 185 190 

Gly Gly Val Lys His Arg Leu Gin Phe Val Asp Asp He Lys Gly Val 
195 200 205 

Lys Phe Tyr Asn Asp Ser Lys Ser Thr Asn He Leu Ala Thr Gin Lys 
210 215 220 

Ala Leu Ser Gly Phe Asp Asn Ser Lys Val Val Leu He Ala Gly Gly 
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225 230 235 240 

Leu Asp Arg Gly Asn Glu Phe Asp Glu Leu Val Pro Asp lie Thr Gly 
245 250 255 

Leu Lys Lys Met Val lie Leu Gly Gin Ser Ala Glu Arg Val Lys Arg 
260 265 270 

Ala Ala Asp Lys Ala Gly Val Ala Tyr Val Glu Ala Thr Asp lie Ala 
275 280 285 

Asp Ala Thr Arg Lys Ala Tyr Glu Leu Ala Thr Gin Gly Asp Val Val 
290 295 300 

Leu Leu Ser Pro Ala Asn Ala Ser Trp Asp Met Tyr Ala Asn Phe Glu 
305 310 315 320 

Val Arg Gly Asp Leu Phe lie Asp Thr Val Ala Glu Leu Lys Glu 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAGTTCATCG AAGATGGTTG GGAAGT C C AC TATATCGGGG ACAAGTGTGG TATCGAACAC 6 0 

CAAGAAATCC TTAAGTCAGG TTTGGATGTC ACCTTCCATT CTATTGCGAC TGGAAAATTG 12 0 

CGTCGCTATT TCTCTTGGCA AAATATGCTG GACGTCTTCA AAGTTGGTTG GGGAATTGTC 18 0 

CAATCGCTCT TTATCATGTT GCGACTGCGT CCACAGACCC TTTTTTCAAA GGGGGGC TTT 240 

GTCTCAGTAC CGCCTGTTAT CGCTGCGCGT GTGTCAGGAG TGCCTGTCTT TATTC AC GAA 300 

TCTGACCTGT CTATGGGCTT GGCCAATAAA ATCGCCTATA AATTTGC G AC TAAGATGTAT 3 60 

TCAACCTTTG AACAAGCTTC GAGTTTGGCT AAGGTTGAGC ATGTGGGAGC GG 412 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Ser Ser Ser Lys Met Val Gly Lys Ser Thr He Ser Gly Thr Ser Val 
1 5 10-15 

Val Ser Asn Thr Lys Lys Ser Leu Ser Gin Val Trp Met Ser Pro Ser 
20 25 30 
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He Leu Leu Arg Leu Glu Asn Cys Val Ala He Ser Leu Gly Lys He 
35 40 45 

Cys Trp Thr Ser Ser Lys Leu Val Gly Glu Leu Ser Asn Arg Ser Leu 
50 55 60 

Ser Cys Cys Asp Cys Val His Arg Pro Phe Phe Gin Arg Gly Ala Leu 
65 70 75 80 

Ser Gin Tyr Arg Leu Leu Ser Leu Arg Val Cys Gin Glu Cys Leu Ser 
85 90 95 

Leu Phe Thr Asn Leu Thr Cys Leu Trp Ala Trp Pro He Lys Ser Pro 
100 105 HO 

He Asn Leu Arg Leu Arg Cys He Gin Pro Leu Asn Lys Leu Arg Val 
115 190 125 



Trp Leu Arg Leu Ser Met Trp Glu Arg 
130 135 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

ATCGCTAGCT AGTGAAATGC AAGAAAGTAC AC GTAAATTC AAGGTTACTG CTGACCTAAC 60 

AGATGCCGGT GTTGGAACGA TTGAAGTTCC TTTGAGCATT GAAGATTTAC CCAATGGGCT 12 0 

GACCGCTGTG GCGACTCCGC AAAAAATTAC AGTCAAGATT GGTAAGAAGG CTCAGAAGGA 180 

TAAGGTAAAG ATTGTAC C AG AG ATT G AC C C TAGTCAAATT GATAGTCGGG T AC AAATTG A 240 

AAATGTCATG GTGTCAGATA AAGAAGTGTC TATTACGAGT GACCAAGAGA CATTGGATAG 3 00 

AATTGATAAG ATTATCGCTG TTTTGC C AAC TAGCGAAC.GT ATAACAGGTA ATTACAGTGG - 3 60 

TTCAGTACCT TTGCAGGCAA TCGACCGCAA TGGTGTTGTC TTACCGGCAG TTATCACTCC 42 0 

GTTTGATACA AT AAT GAAGG TGAC T AC AAA ACCAGTAGCA CCAAGTTCAA GCACATCAAA 48 0 

TTCAAGTACA AGCAGTTCAT CGGAGACATC TTCGTCAACG AAAGCAACTA GTTCAAAAAC 540 
GAAT 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ser Leu Ala Ser Glu Met Gin Glu Ser Thr Arg Lys Phe Lys Val Thr 
1 5 10 15 

Ala Asp Leu Thr Asp Ala Gly Val Gly Thr lie Glu Val Pro Leu Ser 
20 25 30 

lie Glu Asp Leu Pro Asn Gly Leu Thr Ala Val Ala Thr Pro Gin Lys 
35 40 45 

lie Thr Val Lys lie Gly Lys Lys Ala Gin Lys Asp Lys Val Lys lie 
50 55 60 

Val Pro Glu lie Asp Pro Ser Gin lie Asp Ser Arg Val Gin lie Glu 
65 70 75 80 

Asn Val Met Val Ser Asp Lys Glu Val Ser lie Thr Ser Asp Gin Glu 
. 85 90 95 

Thr Leu Asp Arg He Asp Lys He He Ala Val Leu Pro Thr Ser Glu 
100 105 110 

Arg He Thr Gly Asn Tyr Ser Gly Ser Val Pro Leu Gin Ala He Asp 
115 . 120 125 

Arg Asn Gly Val Val Leu Pro Ala Val He Thr Pro Phe Asp Thr He 
130 135 140 

Met Lys Val Thr Thr Lys Pro Val Ala Pro Ser . Ser Ser Thr Ser Asn 
145 150 155 160 

Ser Ser Thr Ser Ser Ser Ser Glu Thr Ser Ser Ser Thr Lys Ala Thr 
165 170 175 

Ser Ser Lys Thr Asn 
180 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 67 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

GCACCAGATG GGGCACAAGG TTCAGGGATC AGATGTTGAA AAGTACTACT TTACCCAACG 60 

CGGTCTTGAG CAGGCAGGAA TTACCATTCT TCCTTTTGAT GAAAAAAATC TAGACGGTGA 12 0 
TATGGAAATT ATCGCTGGAA ATGCCTTTCG TCCAGATAAC AAC GTC G AAA TTGCCTATGC ^ 180 

GGACCAAAAT GGTATCAGCT ACAAACGTTA CCATGAGTTT CTAGGTAGCT TTATGC GTGA 240 

CTTTGTTAGC ATGGGAGTAG CAGGAGCACA TGGAAAAACT TC AAC GAC AG GTATGTTGTC 3 00 

TCATGTCTTG TCTCACATTA C AGATAC C AG CTTCTTGATT GGAGATGGGA CAGGTCGTGG 3 60 

TTCGGCCAAT GCCAAATATT TTGTC TTTGA ATCTGACGAA T ATGAGC GTC ACTTCATGCC 420 
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TTACCACCCA GAATACTCTA TTATCACCAA CATTGACTTT GAC CATC C AG ATTATTTCAC 480 

AAGTCTCGAG GATGTTTTTA ATGCCTTTAA CGACTATGCC AAACAAATCA CCAAGGGTCT 540 

TTTTGTCTAT GGTGAAGATG CTGAATTGCG T AAGATT AC G TCTGATGCAC CAATTTATTA 60 0 

TTATGGTTTT GAAGCTGAAG GCAATGACTT TGTAGCTAGT GATCTTCTTC GTTCAATAAC 660 

TGGTTCAACC TTCACCGTTC ATTTCCGTGG ACAAAACTTG GGGCAATTCC ACATTCCAAC 720 

CTTTGGTCGT CACAATATCA TGAATGCGAC AGCCGTTATT GGTCTTCTTT ACACAGCAGG 780 

ATTTGATTTG AACTTGGTGC GTGAGCACTT GAAAACATTT GCCGGTGTTA AACGTCGTTT 840 

C AC TGAGAAA ATTGTCAATG ATACAGTGAT TATCGATGAC TTTGCCCACC ATCCAACAGA 900 

AATTATTGCG ACCTTGGATG CGGCTCGTCA GAAAT AC C C A AGCAAGGAAA TTGTAGCAGT 9 60 
CTTTCAACCG CAT AC C TTT A CAAGAACCAT TGCCTTGTTG GACGACTTTG CCCATGCTTT . 102 0 

AAACCAAGCA GATGCTGTTT ATCTAGCGCA AATTTATGGC TCGGCTCGTG AAGT AG ATC A 1080 

TGGTGACGTT AAGGTAGAAG ACCTAGCCAA CAAAATCAAC AAAAAACACC AAGTGATTAC 1140 

TGTTGAAAAT GTTTCTCCAC TCCTAGACCA TGACAATGCT GTTTACGTCT TTATGGGAGC 12 00 

AGGAGACATC CAAACCTATG AATACTCATT TGAGCGTCTC TTGTCTAACT TGACAAGCAA 12 60 
TGTTCAA 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

His Gin Met Gly His Lys Val Gin Gly Ser Asp Val Glu Lys Tyr Tyr 
15 10 15 

Phe Thr Gin Arg Gly Leu Glu Gin Ala Gly lie Thr lie Leu Pro Phe 
20 25 30 

Asp Glu Lys Asn Leu Asp Gly Asp Met Glu lie lie Ala Gly Asn Ala 
35 40 45 

Phe Arg Pro Asp Asn Asn Val Glu lie Ala Tyr Ala Asp Gin Asn Gly 
50 55 60 

lie Ser Tyr Lys Arg Tyr His Glu Phe Leu Gly Ser Phe Met Arg Asp 
65 70 75 80 

Phe Val Ser Met Gly Val Ala Gly Ala His Gly Lys Thr Ser Thr Thr 
85 90 95 

Gly Met Leu Ser His Val Leu Ser His lie Thr Asp Thr Ser Phe Leu 
100 105 110 
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He Gly Asp Gly Thr Gly Arg Gly Ser Ala Asn Ala Lys Tyr Phe Val 
115 120 125 

Phe Glu Ser Asp Glu Tyr Glu Arg His Phe Met Pro Tyr His Pro Glu 
130 135 140 

Tvr Ser He He Thr Asn He Asp Phe Asp His Pro Asp Tyr Phe Thr 
145 150 155 160 

Ser Leu Glu Asp Val Phe Asn Ala Phe Asn Asp Tyr Ala Lys Gin He 
165 170 175 

Thr Lys Gly Leu Phe Val Tyr Gly Glu Asp Ala Glu Leu Arg Lys He 
180 185 190 

Thr Ser Asp Ala Pro He Tyr Tyr Tyr Gly Phe Glu Ala Glu Gly Asn 
195 200 205 

Asp Phe Val Ala Ser Asp Leu Leu Arg Ser He Thr Gly Ser Thr Phe 
210 215 220 

Thr Val His Phe Arg Gly Gin Asn Leu Gly Gin Phe His He Pro Thr 
225 230 235 240 

Phe Glv Arg His Asn He Met Asn Ala Thr Ala Val He Gly Leu Leu 
245 250 255 

Tvr Thr Ala Gly Phe Asp Leu Asn Leu Val Arg Glu His Leu Lys Thr 
260 265 270 

Phe Ala Gly Val Lys Arg Arg Phe Thr Glu Lys He Val Asn Asp Thr 
275 280 285 

Val He He Asp Asp Phe Ala His His Pro Thr Glu He He Ala Thr 
290 295 300 

Leu Asp Ala Ala Arg Gin Lys Tyr Pro Ser Lys Glu He Val Ala Val 
305 310 315 320 

Phe Gin Pro His Thr Phe Thr Arg Thr He Ala Leu Leu Asp Asp Phe 
325 330 335 

Ala His Ala Leu Asn Gin Ala Asp Ala Val Tyr Leu Ala Gin He Tyr 
340 345 350 

Gly Ser Ala Arg Glu Val Asp His Gly Asp Val Lys Val Glu Asp Leu 
355 360 365 

Ala Asn Lys He Asn Lys Lys His Gin Val lie Thr Val Glu Asn Val 
370 375 380 

Ser Pro Leu Leu Asp His Asp Asn Ala Val Tyr Val Phe Met Gly Ala 
385 390 395 400 

Gly Asp He Gin Thr Tyr Glu Tyr Ser Phe Glu Arg Leu Leu Ser Asn 
405 410 415 

Leu Thr Ser Asn Val Gin 
420 

INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 



117 



(A) LENGTH: 3121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



TTTTAAC CCA 


ACTGTTGGTA 


CTTTCCTTTT 


TACTGCAGGA 


TTGAGCTTGT 


TAGTTTTATT 


oU 


GGTTTCTAAA 


AGGGAAAATG 


GAAAGAAACG 


ACTTGTTCAT 


TTTCTGCTGT 


TG ACT AGC AT 


12 0 


GGGAGTTCAA 


TTGTTGCCGG 


CCAGTGCTTT 


TGGGTTGACC 


AGC C AG ATTT 


TATCTGCCTA 


180 


TAATAGTCAG 


CTTTCTATCG 


GAGTC GGGGA 


ACATTTACCA 


GAGCCTCTGA 


AAATCGAAGG 


240 


TTATCAATAT 


ATTGGTTATA 


TCAAAACTAA 


GAAACAGGAT 


AATACAGAGC 


TTTCAAGGAC 


3 00 


AGTTGATGGG 


AAATACTCTG 


CTCAAAGAGA 


TAGTCAACCA 


AACTCTACAA 


AAACATCAGA 


3 60 


TGTAGTTCAT 


TC AGC TG ATT 


TAGAATGGAA 


CCAAGGACAG 


GGGAAGGTTA 


GTTTACAAGG 


42 0 


TGAAGCATCA 


GGGGATGATG 


GACTTTCAGA 


AAAATCTTCT 


ATAGCAGCAG 


ACAATCTATC 


480 


TTCTAATGAT 


TCATTCGCAA 


GTCAAGTTGA. GCAGAATCCG 


GATCACAAAG 


GAGAATCTGT 


540 


AGTTCGACCA 


ACAGTGCCAG 


AACAAGGAAA 


TCCTGTGTCT 


GC TAC AAC GG 


TGCAGAGTGC 


600 


GGAAGAGGAA 


GTATTGGCGA 


C GAC AAATGA 


TCGACCAGAG 


TATAAACTTC 


CATTGGAAAC 


6 60 


CAAAGGCACG 


CAAGAACCCG 


GTCATGAGGG 


TGAAGCCGCA 


GTCCGTGAAG 


ACTTACCAGT 


720 


CTACACTAAG 


C C AC TAGAAA 


C C AAAGGT AC 


AC AAGG AC C C 


GGACATGAAG 


GTGAAGCTGC 


7 80- 


AGTTCGCGAG 


GAAGAAC C AG 


C TT AC AC AGA 


ACCGTTAGCA 


AC GAAAGGC A 


CGCAAGAGCC 


840 


AGGTCATGAG 


GGCAAAGCTA 


CAGTCCGCGA 


AGAGACTCTA 


GAGTACACGG 


AACCGGTAGC 


y uu 


GAC AAAAGGC 


ACACAAGAAC 


CCGAACATGA 


GGGCGAaCGG 


sCAGTAGAAG 


AAGAACTTCC 


you 


GGC TTT AG AG 


GTCACTACAC 


GAAATAGAAC 


GG AAATC C AG 


AATATTCCTT 


ATACAACAGA 


1020 


AGAAATTCAG 


GATCCAACAC 


TTCTGAAAAA 


TCGTCGTAAG 


ATTGAAC GAC 


AAGGGCAAGC 


1080 


AGGGAC AC GT 


ACAATTCAAT 


ATGAAG AC T A 


CATC GTAAAT 


GGTAATGTCG 


TAGAAACTAA 


1140 


AGAAGTGTCA 


CGAACTGAAG 


TAGCTCCGGT 


C AAC GAAGTC 


GTTAAAGTAG 


GAAC AC TTGT 


1200 


GAAAGTTAAA 


CCTACAGTAG 


AAATTACAAA 


CTTAACAAAA 


GTTGAGAACA 


AAAAATCTAT 


1260 


AAC TGTAAGT 


TATAACTTAA 


TAGAC AC TAC 


C TC AGC AT AT 


GTTTCTGCAA 


AAACGCAAGT 


1320 


TTTCCATGGA 


GACAAGCTAG 


TTAAAGAGGT 


GGATATAGAA 


AATCCTGCCA 


AAGAGCAAGT 


1380 


AATATCAGGT 


TT AG ATT AC T 


AC AC AC C GT A 


TACAGTTAAA 


AC AC AC C T AA 


CTTATAATTT 


1440 


GGGTGAAAAT 


AATGAGGAAA 


ATACTGAAAC 


ATCAACTCAA 


GATTTCCAAT 


TAGAGTATAA 


1500 


GAAAATAGAG 


ATTAAAGATA 


TTGATTCAGT 


AGAATTATAC 


GGTAAAGAAA 


ATGATC GTTA 


1560 


TC GTAGATAT 


TTAAGTCTAA 


GTGAAGCGCC 


GAC TGAT AC G 


GCTAAATACT 


TTGTAAAAGT 


1620 


G AAATC AG AT 


CGCTTCAAAG 


AAATGTACCT 


ACCTGTAAAA 


TC TATTAC AG 


AAAAT AC GG A 


1680 



TGGAAC GT AT 


AAAGTGACGG 


T AGCC GTTGA 


x b AAL- 1 iul^ 


HAAGAAGGTA 


C AGAC GGTTA 


1740 


CAAAGATGAT 


TACACATTTA 


CTGTAGCTAA 


A 1 b. X AAAb?b.rt 




GAGTTTACAC 


1800 


ATC C TTT AAA 


CAGCTGGTAA 


CAGCCATGCA 


AAbib. nA 1L. lb 


1 ^ X VJVJ X VJ 1 


ATACATTGGC 


1860 


TTCAGATATG 


ACCGCAGATG 


AGGTGAGCTT 


Abjbrb.bjA x AAb 


L-AVjri^.^i-rtVj x x 


ATPTPACAGG 


1920 


TGCATTTACA 


GGGAGCTTGA 


TCGGTTCTGA 


TGGAALAAAA 


lLblnl k_Tk_b.A 


TTTATP4ATTT 

XXX .tt. X V_T,rt. XXX 


1980 


GAAGAAAC C A 


TTATTTGATA 


CATTAAATGG 


TGCTACAGl 1 


AbrAbi A x 1 xbibr 


ATATTAAAAf 


2 040 


TGTTTCTGCT 


GATAGTAAAG 


AAAATGTCGC 


AGC GUT bbLb 


AAbbbnbb biA 


r\. X r"iV£»V— bv vj^TJT. 


2100 


TATTAATAAT 


GTTGCAGTAG 


AAGGAAAAAT 


CTCAGGTbiLbj 


AAA rnf -, rp/~irprn/~' 
AAA x lb 1 1 br 


rrrPATTAHT 

V_ kjVjVjr-i X X .rtVj X 


216 0 


AGCGAGCGCA 


ACAAATACAG 


TGATAGAAAA 


CAGCTUCjl 1 i 


Ab.Ab»brbiAAAv- 


1 X/-J.X UOL. J-i-rtJTt 


222 0 


TCACCAGGAC 


AGTAATAAAA 


ATGATACTGG 


AbbAA 1 Abj 1 A 


pnrn^ STATA A 
X jf-i^-i Iril .rt-rt. 


C Ann A A AT AG 


2280 


TTCGAGAGTT 


AATAAAGTTA 


GGGTAGATGC 


CTTAA IL1L1 


Ab X AA 1 bib- Ab 


HPAATAATAA 


2340 


CCAAACAGCT 


GGAGGGATAG 


TAGGTAGATT 


AGAAAA I tj(j 1 


bib. A 1 1 biA 1 A 1 


\_ x nn X X v_ vjvj x 


2400 


TGCTACTGGA 


GAAAT AC GAA 


ATGGTCAAGG. 


ATA lib,! Aba A 


1 LbbAovjnn 


T A (TT A nn ATC 


2460 


TACGTGGCAA 


AACGGTCGAG 


TAAATAATGT 


TGTGAb? 1 AAb, 


\_r 1 AbiA lbl lb 


vjr'iV-xri X wnJJ X X jTI 


252 0 


TGTTATCACC 


GGTGATCAAT 


AC GC AGC AGC 


AGATGTGAAA 


A ITPP a Ap"TA 
AA 1 bib. AAbi 1 A 


C ATP ACTTnA 


2580 


TAATAGAAAA 


GCAGACAGAT 


TCGCTACAAA 


ATTATCAAAA 


bi Ab, b, AAA 1 Aba 


Ab-bib-bjA-HJ-Vvj X 


2640 


TGCTGATTAT 


GGAATCACAG 


TAACTCTTGA 


TGAT AC TGGG 


P7\ 7\ P7\ PP r P r P A A 

LAAbAl I 1AA 




2700 


AAGAGAAGTT 


GATTATACAA 


GACTAAATAA 


AGCAGAAGCT 


GAAAGAAAAG 


T ACjb, I 1 A 1 AC? 


Z / O \J 


CAACATAGAA 


AAAC TGATGC 


CATTCTACAA 


TAAAGAC CT A 


GTAGTTCACT 


ATGGTAACAA 


2820 


■ AGTAGCGACA 


ACAGATAAAC 


TTTACACTAC 


AGAATTGTTA 


GATGTTGTGC 


CGATGAAAGA 


2880 


TGATGAAGTA 


GTAACGGATA 


TTAATAATAA 


GAAAAATTCA 


ATAAATAAAG 


TTATGTTACA 


2940 


TTTCAAAGAT 


AATACAGTAG 


AATACCTAGA 


TGTAACATTC 


AAAGAAAACT 


TCATAAACAG 


3000 


TCAAGTAATC 


GAATACAATG 


TTACAGGAAA 


AGAATATATA 


TTC AC AC C AG 


AAGCATTTGT 


3060 


TTC AG AC TAT 


AC AGC GATAA 


CGAATAACGT 


AC TAAGC G AC 


TTGCAAAATG 


TAACACTTAA 


3120 


C 












3121 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
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Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
x 5 10 15 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 
20 25 30 

His Phe Leu Leu Leu Thr Ser Met Gly Val Gin Leu Leu Pro Ala Ser 
35 40 45 

Ala Phe Gly Leu Thr Ser Gin He Leu Ser Ala Tyr Asn Ser Gin Leu 
50 55 60 

Ser He Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys He Glu Gly 
65 70 75 80 

Tyr Gin Tyr He Gly Tyr He Lys Thr Lys Lys Gin Asp Asn Thr Glu 
85 90 95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Ala Gin Arg Asp Ser Gin 
100 . 105 HO 

Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 
115 120 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 
130 135. 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser He Ala Ala Asp Asn Leu Ser 
145 150 155 160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 
165 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 
180 185 190 

Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 
195 200 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 
210 215 220 

Glu Pro Gly His Glu Gly Glu Ala-Ala Val Arg Glu Asp Leu Pro Val 
225 230 235 240 

Tvr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gin Gly Pro Gly His Glu 
245 250 255 

Glv Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
260 265 270 

Ala Thr Lys Gly Thr Gin Glu Pro Gly His Glu Gly Lys Ala Thr Val 
275 280 285 

Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Val Ala Thr Lys Gly Thr 
290 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
305 310 315 320 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu He Gin Asn He Pro 
325 330 335 

Tyr Thr Thr Glu Glu He Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
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340 345 350 

Lys He Glu Arg Gin Gly Gin Ala Gly Thr Arg Thr He Gin Tyr Glu 
355 360 365 

Asp Tyr He Val Asn Gly Asn Val Val Glu Thr Lys Glu Val Ser Arg 
370 - 375 380 

Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

Lys Val Lys Pro Thr Val Glu He Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser He Thr Val Ser Tyr Asn Leu He Asp Thr Thr Ser Ala 
420 425 .430 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys Leu Val Lys 
435 440 445 

Glu Val Asp He Glu Asn Pro Ala Lys Glu Gin Val He Ser Gly Leu 
450 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu Thr Tyr Asn Leu 
465 470 475 480 

Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys He Glu He Lys Asp lie Asp Ser Val Glu Leu 
500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg Tyr Leu Ser Leu Ser Glu 
515 520 525 

Ala Pro Thr Asp Thr Ala Lys Tyr Phe Val Lys Val Lys Ser Asp Arg 
530 535 540 

Phe Lys Glu Met Tyr Leu Pro Val Lys Ser He Thr Glu Asn Thr Asp 
545 550 555 560 

Glv Thr Tyr Lys Val Thr Val Ala Val Asp Gin Leu Val Glu Glu Gly 
565 570 575 

Thr Asp Gly Tyr Lys Asp Asp Tyr Thr Phe Thr Val Ala Lys Ser Lys 
580 585 590 

Ala Glu Gin Pro Gly Val Tyr Thr Ser Phe Lys Gin Leu Val Thr Ala 
595 600 605 

Met Gin Ser Asn Leu Ser Gly Val Tyr Thr Leu Ala Ser Asp Met Thr 
610 615 620 

Ala Asp Glu Val Ser Leu Gly Asp Lys Gin Thr Ser Tyr Leu Thr Gly 
625 630 635 640 

Ala Phe Thr Gly Ser Leu He Gly Ser Asp Gly Thr Lys Ser Tyr Ala 
645 650 655 . 

He Tyr Asp Leu Lys Lys Pro Leu Phe Asp Thr Leu Asn Gly Ala Thr 
660 665 670 

Val Arg Asp Leu Asp lie Lys Thr Val Ser Ala Asp Ser Lys Glu Asn 
675 680 685 
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Val Ala Ala Leu Ala Lys Ala Ala Asn Ser Ala Asn lie Asn Asn Val 
690 695 700 

Ala Val Glu Gly Lys He Ser Gly Ala Lys Ser Val Ala Gly Leu Val 
705 710 715 720 

Ala Ser Ala Thr Asn Thr Val He Glu Asn Ser Ser Phe Thr Gly Lys 
725 730 735 

Leu He Ala Asn His Gin Asp Ser Asn Lys Asn Asp Thr Gly Gly He 
740 745 750 

Val Gly Asn He Thr Gly Asn Ser Ser Arg Val Asn Lys Val Arg Val 
755 760 765 

Asn Ala Leu He Ser Thr Asn Ala Arg Asn Asn Asn Gin Thr Ala Gly 
770 775 780 

Gly He Val Gly Arg Leu Glu Asn Gly Ala Leu He Ser Asn Ser Val 
785 790 795 800 

Ala Thr Gly Glu He Arg Asn Gly Gin Gly Tyr Ser Arg Val Gly Gly 
805 810 815 

He Val Gly Ser Thr Trp Gin Asn Gly Arg Val Asn Asn Val Val Ser 
820 825 830 

Asn Val Asp Val Gly Asp Gly Tyr Val He Thr Gly Asp Gin Tyr Ala 
835 840 845 

Ala Ala Asp Val Lys Asn Ala Ser Thr Ser Val Asp Asn Arg Lys Ala 
850 855 860 

Asp Arg Phe Ala Thr Lys Leu Ser Lys Asp Gin He Asp Ala Lys Val 
865 870 875 880 

Ala Asp Tyr Gly He Thr Val Thr Leu Asp Asp Thr Gly Gin Asp Leu 
885 890 895 

Lys Arg Asn Leu Arg Glu Val Asp Tyr Thr Arg Leu Asn Lys Ala Glu 
900 905 910 

Ala Glu Arg Lys Val Ala Tyr Ser Asn He Glu Lys Leu Met Pro Phe 
915 920 925 

Tyr Asn Lys Asp Leu Val Val His Tyr Gly Asn Lys Val Ala Thr Thr 
930 935 940 

Asp Lys Leu Tyr Thr Thr Glu Leu Leu Asp Val Val Pro Met Lys Asp 

945 950 955 

Asp Glu Val Val Thr Asp He Asn Asn Lys Lys Asn Ser He Asn Lys 
965 970 975 

Val Met Leu His Phe Lys Asp Asn Thr Val Glu Tyr Leu Asp Val Thr 
980 985 990 

Phe Lys Glu Asn Phe He Asn Ser Gin Val He Glu Tyr Asn Val Thr 
. 995 1000 1005 

Gly Lys Glu Tyr He Phe Thr Pro Glu Ala Phe Val Ser Asp Tyr Thr 
1010 1015 1020 



Ala He Thr Asn Asn Val Leu Ser Asp Leu Gin Asn Val Thr Leu Asn 
1025 1030 1035 104 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 119: 



TTTTAACCCA . 


AC TGTTGGTA 


CTTTCCTTTT 


TACTGCAGGA 


TTGAGC TTGT 


TAGTTTTATT 


6 0 


GGTTTCTAAA 


AGGGAAAATG 


GAAAGAAACG 


ACTTGTTCAT 


TTTCTGCTGT 


TGACTAGCAT 


12 0 


GGGAGTTCAA 


TTGTTGCCGG 


CCAGTGCTTT 


TGGGTTGACC 


AGCCAGATTT 


TATCTGCCTA 


loU 


TAATAGTCAG 


CTTTCTATCG 


GAGTC GGGGA 


ACATTTACCA 


GAGCCTCTGA 


AAATCGAAGG 


240 


TTATCAATAT 


ATTGGTTATA 


TCAAAACTAA 


GAAACAGGAT 


AATACAGAGC 


TTTCAAGGAC 


1 C\ C\ 


AGTTGATGGG 


AAATACTCTG 


CTCAAAGAGA 


TAGTCAACCA 


AACTCTACAA 


AAACATCAGA 


_J b U 


TGTAGTTCAT 


TCAGCTGATT 


TAGAATGGAA 


CCAAGGACAG 


GGGAAGGTTA 


GTTTACAAGG 


/ion 


TGAAGCATCA 


GGGGATGATG 


GACTTTCAGA 


AAAATC TTCT 


ATAGCAGCAG 


AC AATC TATC 


/ion 
4oU 


TTC TAATGAT 


TCATTC GC AA 


GTC AAGTTGA 


GCAGAATCCG 


GATCACAAAG 


GAGAATC TGT 


C yl n 


AGTTC G AC C A 


ACAGTGCCAG 


AACAAGGAAA 


TCCTGTGTCT 


GCTACAACGG 


TGCAGAGTGC 


a n n 
b U U 


GGAAGAGGAA 


GTATTGGCGA 


C GAC AAATGA 


T C GAC C AG AG 


TATAAACTTC 


CATTGGAAAC 


bbu 


CAAAGGCACG 


CAAGAACCCG 


GTCATGAGGG 


TGAAGC C GC A 


GTCCGTGAAG 


ACTTACCAGT 


720 


CTACACTAAG 


CCACTAGAAA 


CCAAAGGTAC 


AC AAGGAC C C 


GGACATGAAG 


GTGAAGCTGC 


780 


AGTTC GC GAG 


GAAGAACCAG 


CTTACACAGA 


AC C GTTAGC A 


AC GAAAGGC A 


CGCAAGAGCC 


840 


AGGTCATGAG 


GGCAAAGCTA 


CAGTCCGCGA 


AGAGACTCTA 


GAGTAC AC GG 


AACCGGTAGC 


900 


GACAAAAGGC 


ACACAAGAAC 


CCGAACATGA 


GGGCGAaCGG 


sCAGTAGAAG 


AAGAAC TTC C 


960 


GGCTTTAGAG 


GTCACTACAC 


GAAATAGAAC 


GGAAATCCAG 


AATATTCCTT 


ATACAACAGA 


1020 


AGAAATTCAG 


GATCCAACAC 


TTCTGAAAAA 


TCGTCGTAAG 


ATTGAACGAC 


AAGGGCAAGC 


1080 


AGGG AC AC GT 


ACAATTCAAT 


ATGAAGACTA 


CATCGTAAAT 


GGTAATGTCG 


TAG AAAC TAA 


1140 


AGAAGT GTC A 


CGAACTGAAG 


TAGCTCCGGT 


CAACGAAGTC 


GTTAAAGTAG 


GAAC AC TTGT 


1200 


GAAAGT T AAA 


CCTACAGTAG 


AAATTACAAA 


CTTAACAAAA 


GTTGAGAACA 


AAAAATCTAT 


1260 


AACTGTAAGT 


TATAACTTAA 


TAG AC AC T AC 


CTCAGCATAT 


GTTTCTGCAA 


AAACGCAAGT 


1320 


TTTCCATGGA 


GAC AAGC TAG 


TTAAAGAGGT 


GGATATAGAA 


AATCCTGCCA 


AAGAGCAAGT 


1380 


AATATCAGGT 


TTAGATTACT 


AC AC AC C GT A 


TACAGTTAAA 


AC AC AC C TAA 


CTTATAATTT 


1440 


GGGTGAAAAT 


AATGAGGAAA 


. AT AC TG AAAC 


ATCAACTCAA 


GATTTCCAAT 


TAGAGTATAA 


1500 
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GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA ATGATC GTTA 15 60 

TCGTAGA 1567 
(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
15 10 15 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 
20 25 30 

His Phe Leu Leu Leu Thr Ser Met Gly Val Gin Leu Leu Pro Ala Ser 
35 . 40 45 

Ala Phe Gly Leu Thr Ser Gin lie Leu Ser Ala Tyr Asn Ser Gin Leu 
50 55 60 

Ser lie Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys lie Glu Gly 
65 70 75 80 

Tyr Gin Tyr He Gly Tyr He Lys Thr Lys Lys Gin Asp Asn Thr Glu 
85 90 95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Ala Gin Arg Asp Ser Gin 
100 105 110 

Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 
115 120 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 
130 135 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser He Ala Ala Asp Asn Leu Ser 
145 150 155 160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 
165 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 
180 185 190 

Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 
195 200 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 
210 215 220 

Glu Pro Gly His Glu Gly Glu Ala Ala Val Arg Glu Asp Leu Pro Val 
225 230 235 240 

Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gin Gly Pro Gly His Glu 
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245 250 255 

Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
260 265 270 

Ala Thr Lys Gly Thr Gin Glu Pro Gly His Glu Gly Lys Ala Thr Val 
275 280 285 

Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Val Ala Thr Lys Gly Thr 
290 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
305 310 315 320 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu lie Gin Asn He Pro 
325 330 335 

Tyr Thr Thr Glu Glu He Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
340 345 350 

Lys He Glu Arg Gin Gly Gin Ala Gly Thr Arg Thr He Gin Tyr Glu 
355 360 365 

Asp Tyr He Val Asn Gly Asn Val Val Glu Thr Lys Glu Val Ser Arg 
370 375 380 

Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

Lys Val Lys Pro Thr Val Glu He Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser He Thr Val Ser Tyr Asn Leu He Asp Thr Thr Ser Ala 
420 425 430 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys Leu Val Lys 
435 440 445 

Glu Val Asp He Glu Asn Pro Ala Lys Glu Gin Val He Ser Gly Leu 
450 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu Thr Tyr Asn Leu 
465 470 475 480 

Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys He Glu lie Lys Asp He Asp Ser Val Glu Leu 
500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg 
515 520 

(2) INFORMATION FOR SEQ ID NO : 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1561 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
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TC GT AG AT AT 1 


TTAAGTC T AA 


(j 1 Vj AAVjC VjVw v- 


vji-iV., X Vjrt X .nv— vJ 


GC T AAAT AC T 


TTGTAAAAGT 


60 


GAAATCAGAT 


CGCTTCAAAG 


11 ■» * mrrPT^ /~ , PT* 

AAATCjTAL-L. 1 




TC TATT AC AG 


AAAAT AC GG A 


120 


TGGAAC GTAT 


AAAGTGACGG 


TAGCCbjl IvjA 


rpp A A pttptp 


GAAGAAGGTA 


CAGACGGTTA 


180 


CAAAGATGAT 


TACACATTTA 


CTGTAGCTAA 


A TP T a a a p_p A 


PAGPAAPPAG 


GAGTTTACAC 


240 


ATC C TTT AAA 


CAGCTGGTAA 


C AGC C ATGC A 


71 A P 1 f r " ' A A TfTC 


1^1 VjO X VJ X V— J. 


ATACATTGGC 


300 


TTCAGATATG 


ACCGCAGATG 


AGGTGAGCTT 


ACjOjC bA 1 AAb 


p a pa p a a P4tt 

v^ .i-ivj.rt.V_ /■irt.vj j. x 


ATPTCAPAGG 


360 


TGCATTTACA 


GGGAGCTTGA 


TCGGTTCTGA 


mo PAAPAAAA 


rpp r,ff A TP4P P A 


TTTATGATTT 


420 


GAAGAAACCA 


TTATTTGATA 


CATTAAATGG 


rpPPTTi O A /""HPT 1 
1 vjL, 1 Av^iivj 1 1 


A P A (Z A TTT PP4 
rtorivjri. ill vjvj 


ATATTAAAAC 


480 


TGTTTCTGCT 


GATAGTAAAG 


AAAATGTCGC 


ACjL. vjL. I VjvjL, vj 


a appp APPPA 

,rt^vjVjv_..rt.vjv«- vj.ii. 


AT AGP GC G AA 


540 


TATTAATAAT 


GTTGCAGTAG 


AAGGAAAAAT 


LTLACjVj 1 vjL,vj 


AAA TPTPtTTP 


C GGG ATT AGT 


600 


AGCGAGCGCA 


ACAAATACAG 


TGATAGAAAA 


L,Avjv_ 1 U\j 111 


ap Annn a aap 


TTATCGCAAA 


660 


TCACCAGGAC 


AGTAATAAAA 


ATGATACTGG 


* pPTi ATA PT A 
ACjVjAA 1 AVj 1 A 


rzr.1T A AT AT A A 
VjVj X X .rt x nn 


CAGGAAATAG 


720 


TTCGAGAGTT 


AATAAAGTTA 


GGGTAGATGC 


/-imrpA A fPPTPT 

t_ 1 inAl L1L1 


APT A ATPP AP 

rt.V_ X rt_tt. X uLA> — • 


GCAATAATAA 


780 


CCAAACAGCT 


GGAGGGATAG 


TAGGTAGATT 


A A A A TlTPPT 

AbAAAA 1 vjVj 1 


PP ATTPAT AT 


C T AATTC GGT 


840 


TGCTACTGGA 


GAAATACGAA 


ATGGTCAAGG 


ATATTL 1 AljA 


P TP PP A PP A A 
ulL vjVj.tt.vjvj.rt-rt. 


TAPTAGGATP 


900 


TACGTGGCAA 


AACGGTCGAG 


TAAATAATGT 


TGTGAbi 1 AAL 


PT a P A TPTTP 
Vj 1 AoA X Vj X X Vj 


PAPATGGTTA 


960 


TGTTATCACC 


GGTGATCAAT 


AC GC AGC AGC 


AGATGTCjAAA 


A A TPP A A PT A 


P ATP APTTGA 
v_iri. x v i^.\j j. x vjri. 


1020 


TAATAGAAAA 


GCAGACAGAT 


TC GC T AC AAA 


ATTATCAAAA 


PAPP7A A A TAP 
bAL AAA 1 AVj 


APPPPAAAGT 


1080 


TGCTGATTAT 


GGAATCACAG 


TAACTCTTGA 


TGAT AC 1 VjVjVj 


P A A P A TTT A A 


AAPGTAATCT 


1140 


AAGAGAAGTT 


GATTATACAA 


GACTAAATAA 


AGCAGAAGCT 


/-^AAA/" , AAAAP 

GAAAbAAAAb 


T 1 A PPTT AT AP 


1200 


CAACATAGAA 


AAACTGATGC 


CATTCTACAA 


TAAAGACCTA 


GTAGTTCACT 


A rrp O rp A A (~* A A 

ATGo x AAL.AA 


1 ocn 


AGTAGCGACA 


ACAGATAAAC 


TTT AC AC T AC 


AGAATT GTT A 


GATGTTGTGC 


CGATGAAAGA 


1320 


TGATGAAGTA 


GTAACGGATA 


TTAATAATAA 


GAAAAATTCA 


ATAAATAAAG 


TTATGTTACA 


1380 


TTTCAAAGAT 


AATACAGTAG 


AAT AC C TAG A 


TGTAACATTC 


AAAGAAAACT 


TCATAAACAG 


1440 


TCAAGTAATC 


GAATACAATG 


TTACAGGAAA 


. AGAATATATA 


TTCACACCAG 


AAGCATTTGT 


1500 


TTCAGACTAT 


ACAGCGATAA 


. CGAATAACGT 


AC T AAGC G AC 


TTGCAAAATG 


TAACACTTAA 


1560 


C 












1561 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Arg Arg Tyr Leu Ser Leu Ser Glu Ala Pro Thr Asp Thr Ala Lys Tyr 
5 10 15 



1 



Phe Vai Lys Val Lys Ser Asp Arg Phe Lys Glu Met Tyr Leu Pro Val 
20 25 30 

Lys Ser He Thr Glu Asn Thr Asp Gly Thr Tyr Lys Val Thr Val Ala 
35 40 45 

Val Asp Gin Leu Val Glu Glu Gly Thr Asp Gly Tyr Lys Asp Asp Tyr 
50 55 60 

Thr Phe Thr Val Ala Lys Ser Lys Ala Glu Gin Pro Gly Val Tyr Thr 
65 70 75 80 

Ser Phe Lys Gin Leu Val Thr Ala Met Gin Ser Asn Leu Ser Gly Val 
85 90 95 

Tyr Thr Leu Ala Ser Asp Met Thr Ala Asp Glu Val Ser Leu Gly Asp 
100 105 HO 

Lys Gin Thr Ser Tyr Leu Thr Gly Ala Phe Thr Gly Ser Leu He Gly 
115 ' 120 125 

Ser Asp Gly Thr Lys Ser Tyr Ala He Tyr Asp Leu Lys Lys Pro Leu 
130 135 140 

Phe Asp Thr Leu Asn Gly Ala Thr Val Arg Asp Leu Asp He Lys Thr 
145 150 155 160 

Val Ser Ala Asp Ser Lys Glu Asn Val Ala Ala Leu Ala Lys Ala Ala 
165 170 175 

Asn Ser Ala Asn He Asn Asn Val Ala Val Glu Gly Lys He Ser Gly 
180 185 190 

Ala Lys Ser Val Ala Gly Leu Val Ala Ser Ala Thr Asn Thr Val He 
195 200 205 

Glu Asn Ser Ser Phe Thr Gly Lys Leu He Ala Asn His Gin Asp Ser 
210 215 . 220 

Asn Lys Asn Asp Thr Gly Gly He Val Gly Asn He Thr Gly Asn Ser 
225 230. 235 240 

Ser Arg Val Asn Lys Val Arg Val Asp Ala Leu He Ser Thr Asn Ala 
245 250 255 

Arg Asn Asn Asn Gin Thr Ala Gly Gly He Val Gly Arg Leu Glu Asn 
260 265 270 

Gly ' Ala Leu He Ser Asn Ser Val Ala Thr Gly Glu He Arg Asn Gly 
275 280 285 

Gin Gly Tyr Ser Arg Val Gly Gly He Val Gly Ser Thr Trp Gin Asn 
290 295 300 

Gly Arg Val Asn Asn Val Val Ser Asn Val Asp Val Gly Asp Gly Tyr 
305 310 315 320 
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Val lie Thr Gly Asp Gin Tyr Ala Ala Ala Asp Val Lys Asn Ala Ser 
325 330 335 

Thr Ser Val Asp Asn Arg Lys Ala Asp Arg Phe Ala Thr Lys Leu Ser 
340 345 350 

Lys Asp Gin He Asp Ala Lys Val Ala Asp Tyr Gly He Thr Val Thr 
355 360 365 

Leu Asp Asp Thr Gly Gin Asp Leu Lys Arg Asn Leu Arg Glu Val Asp 
370 ^ 375 380 

Tyr Thr Arg Leu Asn Lys Ala Glu Ala Glu Arg Lys Val Ala Tyr Ser 
385 390 ■ 395 ^ 400 

Asn He Glu Lys Leu Met Pro Phe Tyr Asn Lys Asp Leu Val Val His 
405 410 415 

Tyr Gly Asn Lys Val Ala Thr Thr Asp Lys Leu Tyr Thr Thr Glu Leu 
420 425 430 

Leu Asp Val Val Pro Met Lys- Asp Asp Glu Val Val Thr Asp He Asn 
435 * 440 445 

Asn Lys Lys Asn Ser He Asn Lys Val Met Leu His Phe Lys Asp Asn 
450 455 460 

Thr Val Glu Tyr Leu Asp Val Thr Phe Lys Glu Asn Phe He Asn Ser 
465 470 475 480 

Gin Val He Glu Tyr Asn Val Thr Gly Lys Glu Tyr He Phe Thr Pro 
485 490 495 

Glu Ala Phe Val Ser Asp Tyr Thr Ala lie Thr Asn Asn Val Leu Ser 
500 505 510 

Asp Leu Gin Asn Val Thr Leu Asn 
515 520 

(2) INFORMATION FOR SEQ ID NO : 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CTTTGGTTTT GAAGGAAGTA AGC GTGG AC A ATTTGCTGTA GAAGGAATCA ATCAACTTCG 6 0 

TGAGCATGTA GACACTCTAT TGATTATCTC AAACAACAAT TTGC TTGAAA TTGTTGATAA 12 0 

GAAAACACCG CTTTTGGAGG CTCTTAGCGA AGC GGATAAC GTTCTTCGTC AAGGTGTTCA 18 0 

AGGGATTACC GATTTGATTA CCAATCCAGG ATTGATTAAC CTTGACTTTG CCGATGTGAA 240 

AAC GGTAATG GCAAACAAAG GGAATGCTCT TATGGGTATT GGTATCGGTA GTGGAGAAGA 300 

AC GTGTGGT A GAAGCGGCAC GTAAGGCAAT CTATTCACCA CTTC TTGAAA CAACTATTGA 3 60 

CGGTGCTGAG GATGTTATCG TCAACGTTAC TGGTGGTCTT GACTTAACCT TGATTGAGGC 42 0 
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AG AAGAGGC T TCACAAATTG TGAACCAGGC AGCAGGTCAA GGAGTGAACA TCTGGCTCGG 480 

TACTTCAATT GATGAAAGTA TGCGTGATGA AATTC GTGT A ACAGTTGTTG CAACGGGTGT 540 

TCGTCAAGAC C GC GT AG AAA AGGTTGTGGC TCCACAAGCT AGATCTGCTA CTAACTACCG 600 

TGAGACAGTG AAACCAGCTC ATTCACATGG CTTTGATCGT CATTTTGATA TGGCAGAAAC 660 

AGTTGAATTG CCAAAACAAA ATCCACGTCG TTTGGAACCA ACTCAGGCAT CTGCTTTTGG 72 0 

TGATTGGGAT CTTCGCCGTG AATCGATTGT TCGTACAACA GATTCAGTCG TTTCTCCAGT 78 0 

CGAGCGCTTT GAAGCCCCAA TTTC AC AAG A TGAAGATGAA TTGGATACAC CTCCATTTTT 840 
CAAAAATCGT 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Phe Gly Phe Glu Gly Ser Lys Arg Gly Gin Phe Ala Val Glu Gly He 
1 5 10 15 

Asn Gin Leu Arg Glu His Val Asp Thr Leu Leu He He Ser Asn Asn 
20 25 30 

Asn Leu Leu Glu He Val Asp Lys Lys Thr Pro Leu Leu Glu Ala Leu 
35 40 45 

Ser Glu Ala Asp Asn Val Leu Arg Gin Gly Val Gin Gly He Thr Asp 
50 55 60 

Leu He Thr Asn Pro Gly Leu He Asn Leu Asp Phe Ala Asp Val Lys 
65 70 75 80 

Thr Val Met Ala Asn Lys Gly Asn Ala Leu Met Gly He Gly He Gly 
85 90 95 

Ser Gly Glu Glu Arg Val Val Glu Ala Ala Arg Lys Ala lie Tyr Ser 
100 105 HO 

Pro Leu Leu . Glu Thr Thr He Asp Gly Ala Glu Asp Val He Val Asn 
115 120 125 

Val Thr Gly Gly Leu Asp Leu Thr Leu He Glu Ala Glu Glu Ala Ser 
130 135 140 

Gin He Val Asn Gin Ala Ala Gly Gin Gly Val Asn He Trp Leu Gly 
145 150 155 160 

Thr Ser He Asp Glu Ser Met Arg Asp Glu He Arg Val Thr Val Val 
165 170 175 

Ala Thr Gly Val Arg Gin Asp Arg Val Glu Lys Val Val Ala Pro Gin 
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180 185 190 

Ala Arg Ser Ala Thr Asn Tyr Arg Glu Thr Val Lys Pro Ala His.Ser 
195 200 205 

His Gly Phe Asp Arg His Phe Asp Met Ala Glu Thr Val Glu Leu Pro 
210 215 220 

Lvs Gin Asn Pro Arg Arg Leu Glu Pro Thr Gin Ala Ser Ala Phe Gly 
225 230 235 240 

Asp Trp Asp Leu Arg Arg Glu Ser lie Val Arg Thr Thr Asp Ser Val 
245 250 255 

Val Ser Pro Val Glu Arg Phe Glu Ala Pro lie Ser Gin Asp Glu Asp 
260 265 270 

Glu Leu Asp Thr Pro Pro Phe Phe Lys Asn Arg 
275 280 

(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



CTACTACCTC 


TCGAGAGAAA 


GTGACCTAGA 


GGTGACCGTT 


TTTGACCATG 


AGCAAGGTCA 


60 


AGCCACCAAG 


GCCGCAGCAG 


GAATTATCAG 


TCC TTGGTTT 


TCCAAACGCC 


GTAATAAAGC 


120 


CTGGTACAAG 


ATGGCGCGCT 


TGGGGGC TGA 


TTTTTATGTG 


GATTTATTAG 


C TGATTT AG A 


180 


GAAATCAGGA 


CAAGAAATCG 


ACTTTTACCA 


GCGTTCGGGA 


GTCTTTCTCT 


TGAAAAAGGA 


240 


TGAATCCAAT 


TTGGAAGAAC 


TTTATCAACT 


GGCCCTCCAG 


CGCAGAGAAG 


AATCTCCCTT 


300 


GATAGGGCAA 


TTAGCCATTC 


TGAACCAAGC 


CTCAGCTAAT 


GAATTATTCC 


CTGGTTTGCA 


360 


GGGATTTGAC 


CGCCTGCTCT 


ATGCTTCTGG 


TGG AGC GAGA 


GTAGATGGCC 


AACTTTTAGT 


420 


GACTCGTTTG 


CTGGAAGTCA 


GTCATGTCAA 


GCTGGTCAAA 


GAAAAAGTGA 


CTCTGACACC 


480 


GTTAGCATCA 


GGC T AC C AG A 


TTGGTGAAGA 


GGAGTTTGAG 


CAGGTTATTT 


TGGCGACGGG 


540 


AGC TTGGTTG 


GGGGACATGT 


TAG AGC C TTT 


AGGTTATGAA 


GTGGATGTCC 


GTCCTCAAAA 


600 


AGGACAACTA 


CGAGATTATC 


AGCTTGCCCA 


AGACATGGAA 


GATTACCCTG 


TTGTCATGCC 


660 


AGAAGGGGAG 


TGGGATTTGA 


TTCCCTTTGC 


AGGTGGGAAA 


TTATCCTTAG 


GCGCTACCCA 


720 


C G AAAATG AC 


ATGGGATTTG 


ATTTGACGGT 


AGATGAAACC 


TTGCTCCAAC 


AAATGGAGGA 


780 


GGCCACCTTG 


ACTCACTATC 


TGATTTTGGC 


TGAAGC T AC T 


TCAAAATCTG 


AGC GTGTTGG 


840 


AATCCGTGCC 


TACACCAGTG 


ATTTCTCTCC 


TTTC TTTGGG 


CAGGTGCCTG 


AC TT AAC TGG 


900 


TGTCTATGCA 


GCCAGTGGAC 


TAGGTTCATC 


AGGCCTCACA 


ACTGGTCCTA TCATTGGTTA 


960 
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CCATCTAGCC CAACTGATCC AAGACAAGGA GTTGACCTTG GACCCTCTAA ATTACCCAAT 102 0 

TGAAAACTAT GTCAAACGAG T AAAAAGC G A A 1051 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Tvr Tvr Leu Ser Arg Glu Ser Asp Leu Glu Val Thr Val Phe Asp His 
1 5 10 15 

Glu Gin Gly Gin Ala Thr Lys Ala Ala Ala Gly He He Ser Pro Trp 
20 25 30 

Phe Ser Lys Arg Arg Asn Lys Ala Trp Tyr Lys Met Ala Arg Leu Gly 
35 40 45 

Ala Asp Phe Tyr Val Asp Leu" Leu Ala Asp Leu Glu Lys Ser Gly Gin 
50 55 60 

Glu He Asp Phe Tyr Gin Arg Ser Gly Val Phe Leu Leu Lys Lys Asp 
65 70 75 80 

Glu Ser Asn Leu Glu Glu Leu Tyr Gin Leu Ala Leu Gin Arg Arg Glu 
85 90 95 

Glu Ser Pro Leu He Gly Gin Leu Ala He Leu Asn Gin Ala Ser Ala 
100 105 HO 

Asn Glu Leu Phe Pro Gly Leu Gin Gly Phe Asp Arg Leu Leu Tyr Ala 
115 120 125 

Ser Gly Gly Ala Arg Val Asp Gly Gin Leu Leu Val Thr Arg Leu Leu 
130 135 140 

Glu Val Ser His Val Lys Leu Val Lys Glu Lys Val Thr Leu Thr Pro 
145 150 155 160 

Leu Ala Ser Gly Tyr Gin He Gly Glu Glu Glu Phe Glu Gin Val He 
165 170 175 

Leu Ala Thr Gly Ala Trp Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr 
180 185 190 

Glu Val Asp Val Arg Pro Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu 
195 200 205 

Ala Gin Asp Met Glu Asp Tyr Pro Val Val Met Pro Glu Gly Glu Trp 
210 215 220 

Asp Leu He Pro Phe Ala Gly Gly Lys Leu Ser Leu Gly Ala Thr His 
225 230 235 240 

Glu Asn Asp Met Gly Phe Asp Leu Thr Val Asp Glu Thr Leu Leu Gin 
245 250 255 
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Gin Met Glu Glu Ala Thr Leu Thr His Tyr Leu lie Leu Ala Glu Ala 
260 265 270 

Thr Ser Lys Ser Glu Arg Val Gly He Arg Ala Tyr Thr Ser Asp Phe 
275 280 285 

Ser Pro Phe Phe Gly Gin Val Pro Asp Leu Thr Gly Val Tyr Ala Ala 
290 295 300 

Ser Gly Leu Gly Ser Ser Gly Leu Thr Thr Gly Pro He He Gly Tyr 
305 310 315 320 

His Leu Ala Gin Leu He Gin Asp Lys Glu Leu Thr Leu Asp Pro Leu 
325 330 335 

Asn Tvr Pro He Glu Asn Tyr Val Lys Arg Val Lys Ser Glu 
340 345 350 

(2) INFORMATION FOR SEQ ID NO: 12 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GCTAGAAAAG ATTGGAGCTG ACTCGGTTAT 60 

CTCGCCAGAG TATGAAATGG GGCAGTCTCT AGCACAGACC ATTCTTTTCC ATAATAGTGT 12 0 

TGATGTCTTT CAGTTGGATA AAAATGTGTC TATCGTGGAG ATGAAAATTC CTCAGTCTTG 18 0 

GGCAGGTCAA AGTCTGAGTA AATTAGACCT CCGTGGCAAA TACAATCTGA ATATTTTGGG 240 

TTTCCGAGAG CAGGAAAATT CCCCATTGGA TGTTGAATTT GGACCAGATG ACCTCTTGAA 3 00 

AGCAGATACC TATATTTTGG CAGTCATCAA CAACCAGTAT TTGGATACCC TA 3 52 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Lys Val Lys Ser Gin Thr Ala Lys Lys Val Leu Glu Lys He Gly Ala 
1 5 10 15 

Asd Ser Val He Ser Pro Glu Tyr Glu Met Gly Gin Ser Leu Ala Gin 
20 25 30 

Thr He Leu Phe His Asn Ser Val Asp Val Phe Gin Leu Asp Lys Asn 
35 40 45 
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Val Ser He Val Glu Met Lys He Pro Gin Ser Trp Ala Gly Gin Ser 
50 55 60 

Leu Ser Lys Leu Asp Leu Arg Gly Lys Tyr Asn Leu Asn He Leu Gly 
65 70 75 80 

Phe Arg Glu Gin Glu Asn Ser Pro Leu Asp Val Glu Phe Gly Pro Asp 
85 90 95 

Asp Leu Leu Lys Ala Asp Thr Tyr He Leu Ala Val lie Asn Asn Gin 
100 105 HO 

Tyr Leu Asp Thr Leu 
115 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 9: 

TGACGGGTCT CAGGATCAGA CTCAGGAAAT CGCTGAGTGT TTAGCTAGCA AGTATCCTAA 6 0 

TATCGTTAGA GCCATCTATC AGGAAAATAA ATGCCATGGC GGTGCGGTCA ATCGTGGCTT 12 0 

GGTAGAGGCT TCTGGGCGCT ATTTTAAAGT AGTTGACAGT GATGACTGGG TGGATCCTCG 180 

TGCCTACTTG AAAATTC TTG AAAC TTGC AG G AAC TTGAGA GCAAAGGTCA AGAGGTGGAT 240 
GTCTTTG 

(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Gly Ser Gin Asp Gin Thr Gin Glu He Ala Glu Cys Leu Ala Ser 
15 10 15 

Lys Tyr Pro Asn He Val Arg Ala He Tyr Gin Glu Asn Lys Cys His 
20 25 30 

Gly Gly Ala Val Asn Arg Gly Leu Val Glu Ala Ser Gly Arg Tyr Phe 
35 40 45 

Lys Val Val Asp Ser Asp Asp Trp Val Asp Pro Arg Ala Tyr Leu Lys 
50 55 60 

He Leu Glu Thr Cys Arg Asn Leu Arg Ala Lys Val Lys Arg Trp Met 
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65 70 75 

Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1744 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



TAGAGGCTTT 


GC C AAATGGT 


GGGAAGGGCA 


CGAGCGTCGA 


AAAGAGGAAC 




60 


ACAAGAAGAA 


AAAGCTCGCC 


AAAAGGC TGA 


GAAAGAGGCT 


AGATTAbAAt 


aapaapapap 


19 0 

J. £* KJ 


TGAAAAAGCC 


TTACTCGATT 


TGCCTCCTGT 


TGATATGGAA 


7v o<^PP mp AAA 

AC GGGTvjAAA 


rpmprnp SPirZi 
1 1L1 brAb-riVjA 


1 SO 

X CJ U 


GGAAGCTGTT 


CAAAATCTTC 


CACCTATTCC 


AGAAGAAAAG 


mpppfTPP^ 7\ /~i 
TGGG I VjCjAAv- 




240 


CCTGCCTCAA 


GCTGAACTTA 


AATTCCCTGA ACAGGAAGAT 


GA(_\L LAbA 1 bi 


appta A P A TP.T 


3 0 0 


TCAGGTCGAT 


TTTTCAGCCA 


AAGAAGCCCT 


TGAATACAAA 


n<T"T*PP 7\ 7i /™'P r P 

C\L IL-UAAbK-l 




3 60 


TGCACCAGAT 


AAACCAAAAG 


ATCAGTCTAA 


AGAGAAGAAA 


Al J. bx 1 AbrAbi 


AL2i ATS TP A A 


420 


AATC TTAGAA 


GCAACCTTTG 


CTAGCTTTGG 


TATTAAGGTA 


AC AG 1 1 bAAt 


PPPPPCZA A AT 


480 


TGGGCCATCA 


GTGACCAAGT 


ATGAAGTCAA 


GCCGGCTGTT 


GbTb 1 AAbibrbj 


TP AAPPPPAT 


540 


TTCCAATCTA 


TCAGATGACC 


TCGCTCTAGC 


CTTGGCTGCC 


AAAGATGTCC 


GGATTGAAGC 


6 00 


ACCAATCCCT 


GGGAAATCCC 


TAATCGGAAT 


TGAAGTGCCC 


AACTCCGATA 


TTGCCACTGT 


660 


ATCTTTCCGA 


GAACTATGGG 


AACAATCGCA 


AAC GAAAGC A 


GAAAATTTCT 


TGGAAATTCC 


720 


TTTAGGGAAG 


GCTGTTAATG 


GAACCGCAAG 


AGCTTTTGAC 


CTTTCTAAAA 


TGCCCCACTT 


780 


GCTAGTTGCA 


GGTTCAACGG 


GTTCAGGGAA 


GTCAGTAGCA 


GTTAACGGCA 


TTATTGCTAG 


840 


CATTCTCATG 


AAGGC GAGAC 


CAGATCAAGT 


TAAATTTATG 


ATGGTCGATC 


CCAAGATGGT 


900 


TGAGTTATCT 


GTTTACAATG 


ATATTCCCCA 


CCTCTTGATT 


CCAGTCGTGA 


CCAATCCACG 


960 


CAAAGCCAGC 


AAGGCTCTGC 


AAAAGGTTGT 


GGATGAAATG 


GAAAACCGTT 


ATGAACTCTT 


1020 


TGCCAAGGTG 


GGAGTTCGGA 


ATATTGCAGG 


TTTTAATGCC 


AAGGTAGAAG 


AGTTCAATTC 


1080 


CCAGTCTGAG 


TACAAGCAAA 


TTCCGCTACC 


ATTCATTGTC 


GTGATTGTGG 


ATGAGTTGGC 


1140 


TGACCTCATG 


ATGGTGGCCA 


GCAAGGAAGT 


GGAAGATGCT 


ATCATCCGTC 


TTGGGCAGAA 


1200 


GGCGCGTGCT 


GCAGGTATCC 


ACATGATTCT 


TGCAACTCAG 


CGTCCATCTG 


TTGATGTCAT 


1260 


CTCTGGTTTG 


ATT AAGGC C A 


ATGTTC CATC 


TCGTGTAGCA 


TTTGCGGTTT 


CATCAGGAAC 


1320 


AGACTCCCGT 


AC GATTTTGG 


ATGAAAATGG 


AGC AG AAAAA 


CTTCTTGGTC 


GAGGAGACAT 


1380 


GCTCTTTAAA 


CCGATTGATG 


AAAATCATCC 


AGTTCGTCTC 


CAAGGCTCCT 


TTATCTCGGA 


1440 
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TGACGATGTT GAGCGCATTG TGAACTTCAT C AAGAC TC AG GCAGATGCAG ACTACGATGA 15 00 

GAGTTTTGAT CCAGGTGAGG TTTC TGAAAA TGAAGGAGAA TTTTC GGATG GAGATGCTGG 15 6 0 

TGGTGATCCG CTTTTTGAAG AAGCTAAGTC TTTGGTTATC GAAACACAGA AAGCCAGTGC 162 0 

GTCTATGATT CAGCGTCGTT TATCAGTTGG ATTTAACCGT GCGACCCGTC TCATGGAAGA 1680 

ACTGGAGATA GCAGGTGTCA TCGGTCCAGC TGAAGGT AC C AAACCTCGAA AAGTGTTACA 17 40 
ACAA 

(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 2: 

Ara Gly Phe Ala Lys Trp Trp Glu Gly His Glu Arg Arg Lys Glu Glu 
1 5 10 15 

Arg Phe Val Lys Gin Glu Glu Lys Ala Arg Gin Lys Ala Glu Lys Glu 
20 25 30 

Ala Arg Leu Glu Gin Glu Glu Thr Glu Lys Ala Leu Leu Asp Leu Pro 
35 40 45 

Pro Val Asp Met Glu Thr Gly Glu He Leu Thr Glu Glu Ala Val Gin 
50 55 60 

Asn Leu Pro Pro He Pro Glu Glu Lys Trp Val Glu Pro Glu He He 
65 70 75 80 

Leu Pro Gin Ala Glu Leu Lys Phe Pro Glu Gin Glu Asp Asp Ser Asp 
85 90 95 

Asp Glu Asp Val Gin Val Asp Phe Ser Ala Lys Glu Ala Leu Glu Tyr 
100 105 HO 

Lvs Leu Pro Ser Leu Gin Leu Phe Ala Pro Asp Lys Pro Lys Asp Gin 
115 120 125 

Ser Lys Glu Lys Lys He Val Arg Glu Asn He Lys He Leu Glu Ala 
130 135 140 

Thr Phe Ala Ser Phe Gly He Lys Val Thr Val Glu Arg Ala Glu He 
145 150 155 160 

Glv Pro Ser Val Thr Lys Tyr Glu Val Lys Pro Ala Val Gly Val Arg 
165 170 175 

Val Asn Arg He Ser Asn Leu Ser Asp Asp Leu Ala Leu Ala Leu Ala 
180 185 190 

Ala Lys Asp Val Arg He Glu Ala Pro He Pro Gly Lys Ser Leu He 
195 200 205 
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Gly lie Glu Val Pro Asn Ser Asp lie Ala Thr Val Ser Phe Arg Glu 
210 215 220 

Leu Trp Glu Gin Ser Gin Thr Lys Ala Glu Asn Phe Leu Glu lie Pro 
225 230 235 240 

Leu Gly Lys Ala Val Asn Gly Thr Ala Arg Ala Phe Asp Leu Ser Lys 
245 250 255 

Met Pro His Leu Leu Val Ala Gly Ser Thr Gly Ser Gly Lys Ser Val 
260 265 270 

Ala Val Asn Gly lie lie Ala Ser lie Leu Met Lys Ala Arg Pro Asp 
275 280 285 

Gin Val Lys Phe Met Met Val Asp Pro Lys Met Val Glu Leu Ser Val 
290 295 . 300 

Tyr Asn Asp lie Pro His Leu Leu lie Pro Val Val Thr Asn Pro Arg 
305 310 315 320 

Lys Ala Ser Lys Ala Leu Gin Lys Val Val Asp Glu Met Glu Asn Arg 
325 330 335 

Tyr Glu Leu Phe Ala Lys Val Gly Val Arg Asn lie Ala Gly Phe Asn 
340 345 350 

Ala Lys Val Glu Glu Phe Asn Ser Gin Ser Glu Tyr Lys Gin He Pro 
355 360 365 

Leu Pro Phe He Val Val He Val Asp Glu Leu Ala Asp Leu Met Met 
370 375 380 

Val Ala Ser Lys Glu Val Glu Asp Ala He He Arg Leu Gly Gin Lys 
385 390 395 400 

Ala Arg Ala Ala Gly He His Met He Leu Ala Thr Gin Arg Pro Ser 
405 410 415 

Val Asp Val He Ser Gly Leu He Lys Ala Asn Val Pro Ser Arg Val 
420 425 430 

Ala Phe Ala Val Ser Ser Gly Thr Asp Ser Arg Thr lie Leu Asp Glu 
435 440 445 

Asn Gly Ala Glu Lys Leu Leu Gly Arg Gly Asp Met Leu Phe Lys Pro 
450 455 460 

He Asp Glu Asn His Pro Val Arg Leu Gin Gly Ser Phe He Ser Asp 
465 470 475 480 

Asp Asp Val Glu Arg He Val Asn Phe He Lys Thr Gin Ala Asp Ala 
485 490 495 

Asp Tyr Asp Glu Ser Phe Asp Pro Gly Glu Val Ser Glu Asn Glu Gly 
500 505 510 

Glu Phe Ser Asp Gly Asp Ala Gly Gly Asp Pro Leu Phe Glu Glu Ala 
515 520 525 

Lys Ser Leu Val He Glu Thr Gin Lys Ala Ser Ala Ser Met He Gin 
530 535 540 
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Arg Arg Leu Ser Val Gly Phe Asn Arg Ala Thr Arg Leu Met Glu Glu 
545 550 555 560 

Leu Glu lie Ala Gly Val He Gly Pro Ala Glu Gly Thr Lys Pro Arg 
565 570 575 

Lys Val Leu Gin Gin 
580 

(2) INFORMATION FOR SEQ ID NO : 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



TCAAAAAGAG 


AAGGAAAACT 


TGGTTATTGC 


TGGGAAAATA 


GGTCCAGAAC 


CAGAAATTTT 


60 


GGCCAATATG 


TATAAGTTGC 


TGATTGAAGA 


AAATACCAGC 


ATGACTGCGA 


CTGTTAAACC 


12 0 


GAATTTTGGG 


AAGACAAGCT 


TCCTTTATGA. 


AGCTCTGAAA 


AAAGGC GAT A 


TTGACATCTA 


180 


TCCTGAATTT 


AC TGGTAC GG 


TGACTGAAAG 


TTTGCTTCAA 


CCATCACCCA 


AGGTGAGTCA 


240 


TGAACCAGAA 


CAGGTTTATC 


AGGTGGCGCG 


TGATGGCATT 


GCTAAGCAGG 


ATCATCTAGC 


300 


CTATCTCAAA 


CCCATGTCTT 


ATCAAAACAC 


CTATGCTGTA 


GCTGTTCCGA 


AAAAGATTGC . 


360 


TCAAGAATAT 


GGCTTGAAGA 


CCATTTCAGA 


CTTGAAAAAA 


GTGGAAGGGC 


AGTTGAAGGC 


420 


AGGTTTTACA 


C TC GAGTTT A 


AC G AC C GTGA 


AG AT GGAAAT 


AAGGGCTTGC 


AATCAATGTA 


480 


TGGTCTCAAT 


CTCAATGTAG 


CGACCATTGA 


GCCAGCCCTT 


CGCTATCAGG 


CTATTCAGTC 


540 


AGGGGATATT 


C AAATC AC GG 


ATGCCTATTC 


GACTGATGCG 


GAATTGGAGC 


GTTATGATTT 


600 


AC AGGTC TTG 


GAAGATGACA 


AGCAACTCTT 


CCCACCTTAT 


CAAGGGGCTC 


CACTCATGAA 


660 


AGAAGCTCTT 


CTCAAGAAAC 


AC C C AGAGTT 


GGAAAGAGTT 


C TTAATAC AT 


TGGCTGGTAA 


720 


GATTACAGAA 


AGCCAGATGA 


GCCAGCTCAA 


CTACCAAGTC 


GGTGTTGAAG 


GCAAGTCAGC 


780 


AAAGC AAGT A 


GCCAAGGAGT 


TTCTCCAAGA 


ACAAGGTTTG 


TTGAAGAAA 




829 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



Gin Lys Glu Lys Glu. Asn Leu Val He Ala Gly Lys He Gly Pro Glu 
15 10 15 
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Pro Glu lie Leu Ala Asn Met Tyr Lys Leu Leu lie Glu Glu Asn Thr 
20 25 30 

Ser Met Thr Ala Thr Val Lys Pro Asn Phe Gly Lys Thr Ser Phe Leu 
35 40 45 

Tyr Glu Ala Leu Lys Lys Gly Asp lie Asp He Tyr Pro Glu Phe Thr 
50 55 60 

Gly Thr Val Thr Glu Ser Leu Leu Gin Pro Ser Pro Lys Val Ser His 
65 70 75 80 

Glu Pro Glu Gin Val Tyr Gin Val Ala Arg Asp Gly He Ala Lys Gin 
85 90 95 

Asp His Leu Ala Tyr Leu Lys Pro Met Ser Tyr Gin Asn Thr Tyr Ala 
100 105 HO 

Val Ala Val Pro Lys Lys He Ala Gin Glu Tyr Gly Leu Lys Thr He 
115 120 125 

Ser Asp Leu Lys Lys Val Glu Gly Gin Leu Lys Ala Gly Phe Thr Leu 
130 135 140 

Glu Phe Asn Asp Arg Glu Asp Gly Asn Lys Gly Leu Gin Ser Met Tyr 
145 150 155 160 

Gly Leu Asn Leu Asn Val Ala Thr He Glu Pro Ala Leu Arg Tyr Gin 
165 170 175 

Ala He Gin Ser Gly Asp He Gin He Thr Asp Ala Tyr Ser Thr Asp 
180 185 190 

Ala Glu Leu Glu Arg Tyr Asp Leu Gin Val Leu Glu Asp Asp Lys Gin 
195 200 205 

Leu Phe Pro Pro Tyr Gin Gly Ala Pro Leu Met Lys Glu Ala Leu Leu 
210 215 220 

Lys Lys His Pro Glu Leu Glu Arg Val Leu Asn Thr Leu Ala Gly Lys 
225 230 235 240 

He Thr Glu Ser Gin Met Ser Gin Leu Asn Tyr Gin Val Gly Val Glu 
245 250 255 

Gly Lys Ser Ala Lys Gin Val Ala Lys Glu Phe Leu Gin Glu Gin Gly 
260 265 270 

Leu Leu Lys Lys 
275 

INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 5: 
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ACGTTCTATT GAGGACCACT TTGATTCAAA CTTCGAATTG GAATATAACC TCAAAGAAAA 60 

AGGGAAAACA GATCTTTTGA AGCTAGTTGA TAAAACAACT GACATGCGTC TGCATTTTAT 12 0 

CCGCCAAACT CATCCACGCG GTCTCGGAGA TGCTGTTTTG CAAGCCAAGG CTTTCGTCGG 18 0 

AAATGAAC C T TTTGTCGTTA TGCTTGGTGA TGACTTGATG GATATCACAG AC G AAAAGGC 240 

TGTTCCACTT ACCAAACAAC TCATGGATGA CTACGAGCGT ACCCACGCGT CTACTATCGC 3 00 

TGTCATGCCA GTCCCTCATG AC GAAGT ATC TGCTTACGGG GTTATTGCTC CGCAAGGCGA 3 60 

AGGAAAAGAT GGTCTTTACA GTGTTGAAAC CTTTGTTGAA AAACCAGCTC CAGAGGACGC 42 0 

TCCTAGCGAC CTTGCTATTA TCGGACGCTA CCTCCTCACG CCTGAAATTT TTGAGATTCT 48 0 

CGAAAAGCAA GCTCCAGGTG CAGGAAATGA AATTCAGCTG ACAGATGCAA TCGACACCCT 540 

CAATAAAACA CAACGTGTAT TTGCTCGTGA GTTCAAAGGG GCTCGTTACG ATGTC GGAGA 6 00 

CAAGTTTGGC TTCATGAAAA CATCCATCGA CTACGCCCTC AAACACCCAC AAGTCAAAGA 6 60 

TGATTTGAAG AATTACCTCA TCCAACTTGG AAAAGAATTG ACTGAGAAGG AA 712 
(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Arg Ser lie Glu Asp His Phe Asp Ser Asn Phe Glu Leu Glu Tyr Asn t 
15 10 15 

Leu Lys Glu Lys Gly Lys Thr Asp Leu Leu Lys Leu Val Asp Lys Thr 
20 25 30 

Thr Asp Met Arg Leu His Phe lie Arg Gin Thr His Pro Arg Gly Leu 
35 40 45 

Gly Asp Ala Val Leu Gin. Ala Lys Ala Phe Val Gly Asn Glu Pro Phe 
50 55 60 

Val Val Met Leu Gly Asp Asp Leu Met Asp lie Thr Asp Glu Lys Ala 
65 70 75 80 

Val Pro Leu Thr Lys Gin Leu Met Asp Asp Tyr Glu Arg Thr His Ala 
85 90 95 

Ser Thr He Ala Val Met Pro Val Pro His Asp Glu Val Ser Ala Tyr 
100 105 HO 

Gly Val He Ala Pro Gin Gly Glu Gly Lys Asp Gly Leu Tyr Ser Val 
115 120 125 

Glu Thr Phe Val Glu Lys Pro Ala Pro Glu Asp Ala Pro Ser Asp Leu 
130 135 140 
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Ala lie lie Gly Arg Tyr Leu Leu Thr Pro Glu' lie Phe Glu lie Leu 
145 150 155 160 

Glu Lvs Gin Ala Pro Gly Ala Gly Asn Glu lie Gin Leu Thr Asp Ala 
165 170 175 

He Asp Thr Leu Asn Lys Thr Gin Arg Val Phe Ala Arg Glu Phe Lys 
180 185 190 

Gly Ala Arg Tyr Asp Val Gly Asp Lys Phe Gly Phe Met Lys Thr Ser 
195 200 205 

He Asp Tyr Ala Leu Lys His Pro Gin Val Lys Asp Asp Leu Lys Asn 
210 215 220 

Tyr Leu He Gin Leu Gly Lys Glu Leu Thr Glu Lys Glu 
225 ■ 230 235 

(2) INFORMATION FOR SEQ ID NO : 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

CGCTCAAAAT ACCAGAGGTG TTCAGCTAAT CGAGCACGTT TCTCCTCAAA TGTTGAAAGC 60 

CCAATTGGAG AGTGTCTTTT CTGATATTCC ACCTCAGGCT GTAAAAACTG GAATGTTGGC 12 0 

TACTACTGAA ATCATGGAAA TCATCCAACC CTATCTTAAA AAAC TGGATT GTCCCTATGT 180 

CCTTGATCCT GTTATGGTTG C T AC AAGTGG AGATGCCTTG ATTGACTCAA ATGC TAGAGA 240 

CTATCTCAAA AC AAAC TT AC TACCTCTAGC AACTATTATT ACGCCAAATC TTCCTGAAGC 3 00 

AGAAGAGATT GTTGGTTTTT CAATCCATGA CCCCGAAGAC ATGCAGCGTG CTGGTCGCCT 3 60 

GATTTTAAAA GAATTTGGTC CTCAGTCTGT GGTTATCAAA GGCGGACATC TCAAAGGTGG 42 0 

TGCTAAAGAT TTCCTCTTTA CCAAGAATGA ACAATTTGTC TGGGAAAGCC C AC GAATTC A 480 

AACCTGTCAC ACCCATGGTA CT 502 
(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Ala Gin Asn Thr Arg Gly Val Gin Leu He Glu His Val Ser Pro Gin 
5 10 15 



1 
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Met Leu Lys Ala Gin Leu Glu Ser Val Phe Ser Asp lie Pro Pro Gin 
20 25 30 

Ala Val Lys Thr Gly Met Leu Ala Thr Thr Glu lie Met Glu lie lie 
35 40 45 

Gin Pro Tyr Leu Lys Lys Leu Asp Cys Pro Tyr Val Leu Asp Pro Val 
50 55 60 

Met Val Ala Thr Ser Gly Asp Ala Leu lie Asp Ser Asn Ala Arg Asp 
65 70 75 80 

Tyr Leu Lys Thr Asn Leu Leu Pro Leu Ala Thr He He Thr Pro Asn 
85 90 95 

Leu Pro Glu Ala Glu Glu He Val Gly Phe Ser He His Asp Pro Glu 
100 105 HO 

Asp Met Gin Arg Ala Gly Arg Leu He Leu Lys Glu Phe Gly Pro Gin 
115 120 125 

Ser Val Val He Lys Gly Gly His Leu Lys Gly Gly Ala' Lys Asp Phe 
130 135 140 

Leu Phe Thr Lys Asn Glu Gin Phe Val Trp Glu Ser Pro Arg He Gin 
145 150 155 160 

Thr Cys His Thr His Gly Thr 
165 

(2) INFORMATION FOR SEQ ID NO : 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

AATTGTACAA TTAGAAAAAG ATAGCAAATC AGACAAAGAA CAAGTTGATA AACTATTTGA 60 

ATC ATTTG AT GCATCTTCAG ATGAATC TAT TTC TAAATTA AAAGAACTAT CTGAAACTTC " 12 0 

ACTTAAAACC GATGCAGGTA AAGACTATCT TAATAACAAA GTCAAAGAAT CATCTAAAGC 180 

AATTGTAGAT TTTCATTTGC AAAAAGGTTT GGC TTATGAT GTTAAAGATT CAGATGACAA 24 0 

ATTTAAAGAT AAAGCAACTC TTGAAACAAA TGTAAAAGAA ATTACAAAAC AAATTGATTT 3 00 

TATCAAAAAA GTTGATGAAA CTTTTAAACA AGAGAATTTG GAAGAAACTC TTAAATCTCT 3 60 

AAATGATCTT GTTGATAAAT ATC AAAAAC A AATCGAACTT TTGAAGAAAG AAGAAGAAAA 42 0 

AGCTGCTGAA AAAGCTGCTG AAAAAGCAAA GGAATC TTC T AGTCAAAGTA ATTCTTCTGG 48 0 

TAGTGCTTCT AATGAGTCTT AT AATGG AT C TTC C AATTC A AATGTAGATT ATAGTTCATC 54 0 

TGAACAAACT AATGGATATT CAAATAATTA TGGCGGTCAA GATTATTC TG GTTCAGGAGA 600 

TAGTTCAACA AATGGTGGAT CATCAGAACA ATATTCATCT AGC AATTCAA AC AGC GGAGC 66 0 
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AAATAATGTC TACAGATATA AAGGCACTGG TGCTGACGGC TATCAAAGAT ACTACTACAA 72 0 

AGATCATAAT AATGGAGATG TGTATGATGA CGATGGAAAT TACCTTGGGA ACTTTGGTGG 7 80 

CGGCATTGCA GAACCTAGTC AACGC 805 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

lie Val Gin Leu Glu Lys Asp Ser Lys Ser Asp Lys Glu Gin Val Asp 
15 10 15 

Lys Leu Phe Glu Ser Phe Asp Ala Ser Ser Asp Glu Ser lie Ser Lys 
20 25 30 

Leu Lys Glu Leu Ser Glu Thr Ser Leu Lys Thr Asp Ala Gly Lys Asp 
35 40 45 

Tyr Leu Asn Asn Lys Val Lys Glu Ser Ser Lys Ala lie Val Asp Phe 
50 55 60 

His Leu Gin Lys Gly Leu Ala Tyr Asp Val Lys Asp Ser Asp Asp Lys 
65 70 75 80 

Phe Lys Asp Lys Ala Thr Leu Glu Thr Asn Val Lys Glu lie Thr Lys 
85 90 95 

Gin lie Asp Phe He Lys Lys Val Asp Glu Thr Phe Lys Gin Glu Asn 
100 105 HO 

Leu Glu Glu Thr Leu Lys Ser Leu Asn Asp Leu Val Asp Lys Tyr Gin 
115 120 125 

Lys Gin He Glu Leu Leu Lys Lys Glu Glu. Glu Lys Ala Ala Glu Lys 
130 135 140 

Ala Ala Glu Lys Ala Lys Glu Ser Ser Ser Gin Ser Asn Ser Ser Gly 
145 150 155 160 

Ser Ala Ser Asn Glu Ser Tyr Asn Gly Ser Ser Asn Ser Asn Val Asp 
165 170 175 

Tyr Ser Ser Ser Glu Gin Thr Asn Gly Tyr Ser Asn Asn Tyr Gly Gly 
180 185 190 

Gin Asp Tyr Ser Gly Ser Gly Asp Ser Ser Thr Asn Gly Gly Ser Ser 
195 200 205 

Glu Gin Tyr Ser Ser Ser Asn Ser Asn Ser Gly Ala Asn Asn Val Tyr 
210 215 220 

Arg Tyr Lys Gly Thr Gly Ala Asp Gly Tyr Gin Arg Tyr Tyr Tyr Lys 
225 230 235 240 
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Asp His Asn Asn Gly Asp Val Tyr Asp Asp Asp Gly Asn Tyr Leu Gly 
* 245 250 255 

Asn Phe Gly Gly Gly He Ala Glu Pro Ser Gin Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 141: 
TCTGACCAAG CAAAAAGAAG CAGTCAATGA CAAAGGAAAA GCAGCTGTTG TTAAGGTGGT 
GGAAAGCCAG GCAGAACTTT ATAGCTTAGA AAAGAATGAA GATGCTAGCC TAAGAAAGTT 
ACAAGCAGAT GGACGCATCA CGGAAGAACA GGCTAAAGCT TATAAAGAAT ACAATGATAA 
AAATGGAGGA GCAAATCGTA AAGTCAATGA T 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



60 
120 
180 
211 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 

Leu Thr Lys Gin Lys Glu Ala Val Asn Asp Lys Gly Lys Ala Ala Val 
! 5 10 15 

Val Lys Val Val Glu Ser Gin Ala Glu Leu Tyr Ser Leu Glu Lys Asn 
20 25 30 

Glu Asp Ala Ser Leu Arg Lys Leu Gin Ala Asp Gly Arg He Thr Glu 
35 40 45 

Glu Gin Ala Lys Ala Tyr Lys Glu Tyr Asn Asp Lys Asn Gly Gly Ala 
50 55 60 

Asn Arg Lys Val Asn Asp 
65 70 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC GGTAGAGGAA CAGATTTTCT TTATGGAGTT 60 

TGAAGAACTC TATCGGGAAA CCCAAAAACG CAGTGTAGCC AGTCAGCAAA AGACTAGTCT 12 0 

G AAC TT AG AT GGGCAGACGC TTAGCAATGG CAGTCAAAAG TTGCCAGTCC CTAAAGGAAT 180 

TCAGGCCCCA TC AGGC C AAA GTATTACATT TGACCGAGCT GGGGGCAATT CGTCCCTGGC 240 

TAAGGTTGAA TTTCAGACCA GTAAAGGAGC GATTCGCTAT CAATTATATC TAGGAAATGG 3 00 

AAAAATTAAA CGCATTAAGG AAACAAAAAA T 331 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Ser Gly Ser Val Gin Ser Thr Phe Ser Ala Val Glu Glu Gin He Phe 
1 5 10 15 

Phe Met Glu Phe Glu Glu Leu Tyr Arg Glu Thr Gin Lys Arg Ser Val 
20 25 30 

Ala Ser Gin Gin Lys Thr Ser Leu Asn Leu Asp Gly Gin Thr Leu Ser 
35 40 45 

Asn Gly Ser Gin Lys Leu Pro Val Pro Lys Gly He Gin Ala Pro Ser 
50 55 60 

Gly Gin Ser He Thr Phe Asp Arg Ala Gly Gly Asn Ser Ser Leu Ala 
65 70 75 80 

Lvs Val Glu Phe : Gin Thr Ser Lys Gly Ala He Arg Tyr . Gin Leu Tyr 
85 90 95 

Leu Gly Asn Gly Lys He Lys Arg He Lys Glu Thr Lys Asn 
" 100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GGGACAAATT CAAAAAAATA GGCAAGAGGA AGCAAAAATC TTGCAAAAGG AAGAAGTC TT 
GAGGGTAGCT AAGATGGCCC TGCAGACGGG GCAAAATCAG GTAAGCATCA AC GGAGTTGA 



60 
120 
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GATTCAGGTA TTTTCTAGTG AAAAAGGATT GG AGGTC T AC CATGGTTCAG AACAGTTGTT 18 0 

GGCAATCAAA GAGCCA 196 
(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE ' CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: 

Gly Gin lie Gin Lys Asn Arg Gin Glu Glu Ala Lys lie Leu Gin Lys 
15 10 15 

Glu Glu Val Leu Arg Val Ala Lys Met Ala Leu Gln -Thr- Gly Gin Asn 
20 25 30 

Gin Val Ser He Asn Gly Val Glu He Gin Val Phe Ser Ser Glu Lys 
35 40 45 

Gly Leu Glu Val Tyr His Gly Ser Glu Gin Leu Leu Ala He Lys Glu 
50 55 60 

Pro 
65 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TCGCTACCAG CAACAAAGCG AGCAAAAGGA GTGGCTCTTG TTTGTGGACC AACTTGAGGT 6 0 

AGAATTAGAC CGTTCGCAGT TCGAAAAAGT AGAAGGCAAT CGCCTATACA TGAAGCAAGA 12 0 

TGGCAAGGAC ATCGCCATCG GTAAGTCAAA GTCAGATGAT TTCCGTAAAA CGAATGCTCG 18 0 

TGGTCGAGGT TATCAGCCTA TGGTTTATGG ACTCAAATCT GT AC GGATT A CAGAGGACAA 240 

TCAACTGGTT CGCTTTCATT TCCAGTTCCA AAAAGGC TT A GAAAGGGAGT TCATCTATCG 3 00 
TGTGGAAAAA GAAAAAAGT 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Ara Tyr Gin Gin Gin Ser Glu Gin Lys Glu Trp Leu Leu Phe Val Asp 
1 5 10 .. 15 

Gin Leu Glu Val Glu Leu Asp Arg Ser Gin Phe Glu Lys Val Glu Gly 
20 25 30 

Asn Arg Leu Tyr Met Lys Gin Asp Gly Lys Asp lie Ala lie Gly Lys 
35 40 45 

Ser Lys Ser Asp Asp Phe Arg Lys Thr Asn Ala Arg Gly Arg Gly Tyr 
50 55 60 

Gin Pro Met Val Tyr Gly Leu Lys Ser Val Arg lie Thr Glu Asp Asn 
65 70 75 80 

Gin Leu Val Arg Phe His Phe Gin Phe Gin Lys Gly Leu Glu Arg Glu 
85 90 95 

Phe lie Tyr Arg Val Glu Lys Glu Lys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

GAACCGACAA GTCGCCCACT ATCAAGACTA TGCTTTGAAT AAAGAAAAAT TGGTTGCTTT 60 

TGCTATGGCT AAACGAACCA AAGATAAGGT TGAGCAAGAA AGTGGGGAAC AGTTTTTTAA 12 0 

TCTAGGTCAG GTAAGCTATC AAAACAAGAA AACTGGCTTA GTGAC G AGGG TTCGTACGGA 18 0 

' TAAGAGCCAA TATGAGTTTC TGTTTCCTTC AGTCAAAATC; AAAGAAGAGA AAAGAGATAA 240 

AAAGGAAGAG GTAGCGACCG ATTCAAGCGA AAAAGTGGAG AAGAAAAAAT CAGAAGAGAA 3 00 

GCCTGAAAAG AAAGAGAATT CA 322 
(2) INFORMATION FOR SEQ ID NO: 15 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 0: 

Asn Arg Gin Val Ala His Tyr Gin Asp Tyr Ala Leu Asn Lys Glu Lys 
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Leu Val Ala Phe Ala Met Ala Lys Arg Thr Lys Asp Lys Val Glu Gin 
20 25 30 

Glu Ser Gly Glu Gin Phe Phe Asn Leu Gly Gin Val Ser Tyr Gin Asn 
35 40 45 

Lys Lys Thr Gly Leu Val Thr Arg Val Arg Thr Asp Lys Ser Gin Tyr 
50 55 60 

Glu Phe Leu Phe Pro Ser Val Lys He Lys Glu Glu Lys Arg Asp Lys 
65 70 75 80 

Lvs Glu Glu Val Ala Thr Asp Ser Ser Glu Lys Val Glu Lys Lys Lys 
85 90 95 

Ser Glu Glu Lys Pro Glu Lys Lys Glu Asn Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO : 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 84 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
GGTTGTCGGC TGGCAATATA TCCCGTTTCC ATCTAAAGGT AGTACAATTG GTCCTTACCC 
AAATGGTATC AGATTAGAAG GTTTTCCAAA GTCAGAGTGG TACTACTTCG ATAAAAATGG 
AGTGCTACAA GAGTTTGTTG GTTGGAAAAC ATTAGAGATT AAAACTAAAG ACAGTGTTGG 
AAGAAAGTAC GGGGAAAAAC GTGAAGATTC AGAAGATAAA GAAGAGAAGC GTTATTATAC 
GAAC TATTAC TTTAATCAAA ATCATTCTTT AGAGACAGGT TGGCTTTATG ATCAGTCTAA 
C TGGT ATT AT CTAGCTAAGA CGGAAATTAA TGGAGAAAAC TACCTTGGTG GTGAAAGACG 
TGCGGGGTGG ATAAACGATG ATTCGACTTG GT AC T AC C T A GATCCAACAA C TGGT ATT AT 
GCAAACAGGT TGGCAATATC TAGGTAATAA GTGGTACTAC CTCCGTTCCT CAGGAGCAAT 
GGCCACTGGC TGGTATCAGG AAGGT AC C AC TTGGTATTAT TTAGAC C AC C CAAATGGCGA 
TATGAAAACA GGTTGGCAAA ACCTTGGGAA C AAATGGTAC - TATCTCCGTT CATCAGGAGC 
TATGGCAACT GGTTGGTATC AAGATGGTTC AAC TTGGT AC TACCTAAATG CAGGTAATGG 
AGACATGAAG ACAGGTTGGT TCCAGGTCAA ' TGGCAACTGG TACTATGCTT ATAGCTCAGG 
TGCTTTGGCA GTGAATACGA CCGTAGATGG CTATTCTGTC AACTATAATG GCGAATGGGT 
TCGG 

(2) INFORMATION FOR SEQ ID NO:152: 

(i) SEQUENCE CHARACTERISTICS :• 

(A) LENGTH: 2 61 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Val Val Gly Trp Gin Tyr He Pro Phe Pro Ser Lys Gly Ser Thr He 
15 10 15 

Glv Pro Tyr Pro Asn Gly He Arg Leu Glu Gly Phe Pro Lys Ser Glu 
20 25 30 

Trp Tyr Tyr Phe Asp Lys Asn Gly Val Leu Gin Glu Phe Val Gly Trp 

An 45 
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Lys Thr Leu Glu He Lys Thr Lys Asp Ser Val Gly Arg Lys Tyr Gly 
50 55 60 

Glu Lys Arg Glu Asp Ser Glu Asp Lys Glu Glu Lys Arg Tyr Tyr Thr 
65 70 75 80 

Asn Tyr Tyr Phe Asn Gin Asn His Ser Leu Glu Thr Gly Trp Leu Tyr 
85 90 95 

Asp Gin Ser Asn Trp Tyr Tyr Leu Ala Lys Thr Glu He Asn Gly Glu 
100 105 HO 

Asn Tyr Leu Gly Gly Glu Arg Arg Ala Gly Trp He Asn Asp Asp Ser 
115 120 125 

Thr Trp Tyr Tyr Leu Asp Pro Thr Thr Gly He Met Gin Thr Gly Trp 
130 135 140 

Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly ' Ala Met 
145 150 155 160 

Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp His 
165 170 175 

' Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp 
180 185 190 

Tvr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp 
y * 195 200 205 

Gly Ser Thr Trp Tyr Tyr Leu Asn Ala Gly Asn Gly Asp Met Lys Thr 
210 215 220 

Gly Trp Phe Gin Val Asn Gly Asn Trp Tyr Tyr Ala Tyr Ser Ser Gly 
225 230 235 240 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Ser Val Asn Tyr Asn 
245 250 255 

Gly Glu Trp Val Arg 

* 260 j 

INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1708 base pairs 



(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



GGCCAAATCA 


GAATGGGTAG 


AAGACAAGGG 


AGCCTTTTAT 


TATCTTGACC 


AAGATGGAAA 


60 


GATGAAAAGA 


AATGC TTGGG 


TAGGAAC TTC 


CTATGTTGGT 


GCAACAGGTG 


CCAAAGTAAT 


120 


AGAAG AC TGG 


GTCTATGATT 


CTCAATACGA 


TGCTTGGTTT 


TATATCAAAG 


CAGATGGACA 


180 


GCACGCAGAG 


AAAGAATGGC 


TCCAAATTAA 


AGGGAAGGAC 


T ATT ATTTC A 


AATCCGGTGG 


240 


TTATCTACTG 


ACAAGTCAGT 


GGATTAATCA 


AGCTTATGTG 


AATGC TAGTG 


GTGC C AAAGT 


300 


ACAGCAAGGT 


TGGCTTTTTG 


ACAAACAATA 


CCAATCTTGG 


TTTTACATCA 


AAGAAAATGG 


360 


AAACTATGCT 


GATAAAGAAT 


GGATTTTCGA 


GAATGGTCAC 


TATTATTATC 


TAAAATCCGG 


420 


TGGCTACATG 


GCAGCCAATG 


AATGGATTTG 


GGATAAGGAA 


TCTTGGTTTT 


ATCTCAAATT 


480 


TGATGGGAAA 


ATGGCTGAAA 


AAGAATGGGT 


CTACGATTCT 


CATAGTCAAG 


CTTGGTACTA 


540 


CTTCAAATCC 


GGTGGTTACA 


TGACAGCCAA 


TGAATGGATT 


TGGGATAAGG 


AATCTTGGTT 


600 


TTATCTCAAA 


TCTGATGGGA 


AAATAGCTGA 


AAAAGAATGG 


GTCTACGATT 


CTCATAGTCA 


660 


AGCTTGGTAC 


TACTTCAAAT 


CCGGTGGTTA 


CATGACAGCC 


AATGAATGGA 


TTTGGGATAA 


720 


GGAATCTTGG 


TTTTAC CTC A 


AATC TGATGG 


GAAAATAGCT 


GAAAAAGAAT 


GGGTCTACGA 


780 


TTCTCATAGT 


CAAGCTTGGT 


ACTACTTCAA 


ATCTGGTGGC 


TACATGGCGA 


AAAATGAGAC 


840 


AGTAGATGGT 


TATCAGCTTG 


GAAGC GATGG 


TAAATGGCTT 


GGAGGAAAAA 


C T AC AAATGA 


900 


AAATGCTGCT 


TACTATCAAG 


TAGTGCCTGT 


TACAGCCAAT 


GTTTATGATT 


CAGATGGTGA 


960 


AAAGCTTTCC 


TATATATCGC 


AAGGTAGTGT 


CGTATGGCTA 


GATAAGGATA 


GAAAAAGTGA 


1020 


TGACAAGCGC 


TTGGCTATTA 


CTATTTCTGG 


TTTGTCAGGC 


TATATGAAAA 


CAGAAGATTT 


1080 


ACAAGCGCTA 


GATGCTAGTA 


AGGACTTTAT 


CCCTTATTAT 


GAGAGTGATG 


GCCACCGTTT 


1140 


TTATCACTAT 


GTGGCTCAGA 


ATGCTAGTAT 


CCCAGTAGCT 


TCTCATCTTT 


CTGATATGGA 


1200 


AGTAGGCAAG 


AAATATTATT 


CGGCAGATGG 


CCTGCATTTT 


GATGGTTTTA 


AGC TTGAGAA 


1260 


TCCCTTCCTT 


TTCAAAGATT 


TAACAGAGGC 


T AC AAAC T AC 


AGTGCTGAAG 


AATTGGATAA 


1320 


GGTATTTAGT 


TTGCTAAACA 


TTAACAATAG 


CCTTTTGGAG 


AAC AAGGGC G 


CTACTTTTAA 


1380 


GGAAGCCGAA 


GAACATTACC 


ATATCAATGC 


TCTTTATCTC 


CTTGCCCATA 


GTGCCCTAGA 


1440 


AAGTAACTGG 


GGAAGAAGTA 


AAATTGCCAA 


AGATAAGAAT 


AATTTCTTTG 


GCATTACAGC 


1500 


C TATGAT AC G 


ACCCCTTACC 


TTTCTGCTAA 


GACATTTGAT 


GATGTGGATA 


AGGGAATTTT 


1560 


AGGTGCAACC 


AAGTGGATTA 


AGGAAAATTA 


TATCGATAGG 


GGAAGAACTT 


TCCTTGGAAA 


1620 


CAAGGCTTCT 


GGTATGAATG 


TGGAATATGC 


TTCAGACCCT 


TATTGGGGCG 


AAAAAATTGC 


1680 
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TAGTGTGATG ATGAAAATCA ATGAGAAG 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Ala Lys Ser Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp 
15 10 15 

Gin Asp Gly Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val 
20 25 30 

Gly Ala Thr Gly Ala Lys Val He Glu Asp Trp Val Tyr Asp Ser Gin 
35 40 45 

Tyr Asp Ala Trp Phe Tyr He Lys Ala Asp Gly Gin His Ala Glu Lys 
50 55 . 60 

Glu Trp Leu Gin He Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly 
65 70 75 80 

Tvr Leu Leu Thr Ser Gin Trp He Asn Gin Ala Tyr Val Asn Ala Ser 
85 '90 95 

Glv Ala Lys Val Gin Gin Gly Trp Leu Phe Asp Lys Gin Tyr Gin Ser 
100 105 HO 

Trp Phe Tyr He Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp He 
115 120 125 

Phe Glu Asn Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr Met Ala 
130 135 140 

Ala Asn Glu Trp He Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe 
145 150 155 160 

Asp Gly Lys Met Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin 
165 170 175 

Ala Tro Tvr Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp 
180 185 190 

He Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie 
195 200 205 

Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr 
210 215 220 

Phe Lvs Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp He Trp Asp Lys 
225 230 235 240 

Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys He Ala Glu Lys Glu 
245 250 255 

Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser Gly 
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260 265 270 

Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gly Tyr Gin Leu. Gly Ser 
275 280 285 

Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala Tyr 
290 295 300 

Tvr Gin Val Val Pro Val Thr Ala Asn Val Tyr Asp Ser Asp Gly Glu 
305 310 315 320 

Lvs Leu Ser Tyr He Ser Gin Gly Ser Val Val Trp Leu Asp Lys Asp 
325 330 335 

Arg Lys Ser Asp Asp Lys Arg Leu Ala He Thr He Ser Gly Leu Ser 
340 . 345 350 

Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys Asp 
355 360 365 

Phe He Pro Tyr Tyr Glu Ser Asp Gly His Arg Phe Tyr His Tyr Val 
370 375 380 

Ala Gin Asn Ala Ser He Pro Val Ala Ser His Leu Ser Asp Met Glu 
385 390 395 400 

Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly Phe 
405 410 415 

Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala Thr Asn 
420 425 430 

Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn He Asn 
435 440 445 

Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu Glu 
450 455 460 

His Tyr His He Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu Glu 
465 470 475 480 

Ser Asn Trp Gly Arg Ser Lys He Ala Lys Asp Lys Asn Asn Phe Phe 
485 490 495 

Gly He Thr Ala Tyr -Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr Phe 
500 505 510 

Asp Asp Val Asp Lys Gly He Leu Gly Ala Thr Lys Trp He Lys Glu 
515 520 525 

Asn Tyr He Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser Gly 
530 535 540 

Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys He Ala 
545 550 555 560 

Ser Val Met Met Lys He Asn Glu Lys 
565 

INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 5: 
ATTTGCAGAT GATTCTGAAG GATGGCAGTT TGTCCAAGAA AATGGTAGAA CCTACTACAA 6 0- 

AAAGGGGGAT CTAAAAGAAA C C T AC TGGAG AGTGATAGAT GGGAAGTACT ATTATTTTGA 12 0 

TCCTTTATCC GGAGAGATGG TTGTCGGCTG GCAATATATA CCTGCTCCAC AC AAGGGGGT 1 180 

T AC GATTGGT CCTTCTCCAA GAATAGAGAT TGCTCTTAGA CCAGATTGGT TTTATTTTGG 240 

TCAAGATGGT GTATTACAAG AATTTGTTGG CAAGCAAGTT TTAGAAGCAA AAACTGCTAC 3 00 

GAATAC C AAC AAACATCATG GGGAAGAATA TGATAGCCAA GCAGAGAAAC GAGTCTATTA 3 60 

TTTTGAAGAT CAGCGTAGTT ATCATACTTT AAAAACTGGT TGGATTTATG AAGAGGGTCA 42 0 

TTGGTATTAT TTACAGAAGG ATGGTGGCTT TGATTCGCGC ATCAACAGAT TGAC GGTTGG 48 0 

AGAGCTAGCA CGTGGTTGGG TTAAGGATTA CCCTCTTACG TATGATGAAG AG AAGC T AAA 540 

AGCAGCTCCA TGGTACTATC TAAATCCAGC AACTGGCATT ATGCAAACAG GTTGGCAATA 600 

TCTAGGTAAT AGATGGTACT (ACCTCCATTC GTCAGGAGCT ATGGCAACTG GCTGGTATAA 660 

GGAAGGCTCA AC TTGGT AC T ATCTAGATGC TGAAAATGGT GATATGAGAA CTGGCTGGCA 72 0. 

AAACCTTGGG AACAAATGGT ACTATCTCCG TTCATCAGGA GCTATGGCAA CTGGTTGGTA 7 80 

TCAGGAAAGT TCGAC TTGGT ACTATCTAAA TGCAAGTAAT GGAGATATGA AAACAGGCTG 840 

GTTCCAAGTC AATGGTAACT GGTAC TATGC CTATGATTCA GGTGC TTT AG CTGTTAATAC 900 

CACAGTAGGT GGTTACTACT TAAACTATAA TGGTGAATGG GTTAAG 94 6 
(2) INFORMATION FOR SEQ ID NO: 15 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE : amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Val Phe Ala Asp Asp Ser Glu Gly Trp Gin Phe Val Gin Glu Asn Gly 
! 5 10 15 

Arg Thr Tyr Tyr Lys Lys Gly Asp Leu Lys Glu Thr Tyr Trp Arg Val 
20 25 30 

lie Asp Gly Lys Tyr Tyr Tyr Phe Asp Pro Leu Ser Gly Glu Met Val 
35 40 45 

Val Gly Trp Gin Tyr He Pro Ala Pro His Lys Gly Val Thr He Gly 

55 60 



50 
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Pro Ser Pro Arg lie Glu lie Ala Leu Arg Pro Asp Trp Phe Tyr Phe 
65 70 75 80 

Gly Gin Asp Gly Val Leu Gin Glu Phe Val Gly Lys Gin Val Leu Glu 
85 90 95 

Ala Lys Thr Ala Thr Asn Thr Asn Lys His His Gly Glu Glu Tyr Asp 
100 105 110 

Ser Gin Ala Glu Lys Arg Val Tyr Tyr Phe Glu Asp Gin Arg Ser Tyr 
115 120 125 

His Thr Leu Lys Thr Gly Trp He Tyr Glu Glu Gly His Trp Tyr Tyr 
130 135 140 

Leu Gin Lys Asp Gly Gly Phe Asp Ser Arg He Asn Arg Leu Thr Val 
145 150 155 160 

Gly Glu Leu Ala Arg Gly Trp Val Lys Asp Tyr Pro Leu Thr Tyr Asp 
165 170 175 

Glu Glu Lys Leu Lys Ala Ala Pro Trp Tyr Tyr Leu Asn Pro Ala Thr 
180 185 190 

Gly He Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Arg Trp Tyr Tyr 
195 . 200 205 

Leu His Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Lys Glu Gly Ser 
210 215 220 

Thr Trp Tyr Tyr Leu Asp Ala Glu Asn Gly Asp Met Arg Thr Gly Trp 
225 230 235 240 

Gin Asn Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met 
245 250 255 

Ala Thr Gly Trp Tyr Gin Glu Ser Ser Thr Trp Tyr Tyr Leu Asn Ala 
260 265 .270 

Ser Asn Gly Asp Met Lys Thr Gly Trp Phe Gin Val Asn Gly Asn Trp 
275 280 285 

Tyr Tyr- Ala Tyr Asp Ser Gly Ala Leu Ala Val ■ Asn Thr Thr Val Gly 
290 295 300 

Gly Tyr Tyr Leu Asn Tyr Asn Gly Glu Trp Val Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
TGTCGCTGCA AATGAAACTG AAGTAGCAAA AACTTCGCAG G AT AC AAC G A CAGCTTCAAG 
TAGTTCAGAG CAAAATCAGT CTTCTAATAA AACGCAAACG AGCGCAGAAG TACAGACTAA 



60 
120 
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TGCTGCTGCC 


CACTGGGATG 


GGGAT T ATT A 


TGTAAAGGAT 


GATGGTTCTA 


AAGCTCAAAG 


180 


TGAATGGATT 


TTTGACAACT 


AC TATAAGGC 


TTGGTTTTAT 


ATTAATTCAG 


ATGGTC GTT A 


240 


CTCGCAGAAT 


GAATGGCATG 


GAAATT AC T A 


CCTGAAATCA 


GGTGGATATA 


TGGCCCAAAA 


300 


CGAGTGGATC 


TATGACAGTA 


ATTACAAGAG 


TTGGTTTTAT 


CTCAAGTCAG 


ATGGGGCTTA 


360 


TGCTCATCAA 


GAATGGCAAT 


TGATTGGAAA 


TAAGTGGTAC 


TACTTCAAGA 


AGTGGGGTTA 


420 


CATGGCTAAA 


AGCCAATGGC 


AAGGAAGTTA 


TTTCTTGAAT 


GGTCAAGGAG 


C T ATGATGC A 


480 


AAATGAATGG 


CTSCTATGAT 


CCAGCCTATT 


CTGCTTATTT 


TTATCTAAAA 


TCCGATGGAA 


540 


CTTATGCTAA 


CCAAGAGTGG 


CAAAAAGTGG 


GCGGCAAATG 


GTACTATTTC 


AAGAAGTGGG 


600 


GCTATATGGC 


TCGGAATGAG 


TGGCAAGGCA 


AC TAC TATTT 


GACTGGAAGT 


GGTGCCATGG 


660 


C G AC TG AC G A 


AGTGATTATG 


GATGGTACTC 


GCTATATCTT 


TGCGGCCTCT 


GGTGAGCTCA 


720 


AAGAAAAAAA 


AGATTTGAAT 


GTCGGCTGGG 


TTCACAGAGA 


TGGTAAGCGC 


TATTTC TTT A 


780 


ATAATAGAGA 


AGAACAAGTG 


GGAACCGAAC 


ATGCTAAGAA 


AGTCATTGAT 


ATTAGTGAGC 


840 


ACAATGGTCG 


TATCAATGAT 


TGGAAAAAGG 


TTATTGATGA 


GAAC GAAGTG 


GATGGTGTCA 


900 


TTGTTCGTCT 


AGGTTATAGC 


GGTAAAGAAG 


ACAAGGAATT 


GGCGCATAAC 


ATTAAGGAGT 


960 


TAAACCGTCT 


GGGAATTCCT 


TATGGTGTCT 


ATCTCTATAC 


CTATGCTGAA 


AATGAG AC C G 


1020 


ATGCTGAGAG 


TG AC GC T AAA 


CAGACCATTG 


AAC TT AT AAA 


GAAATACAAT 


ATGAACCTGT 


1080 


C TT AC C C TAT 


CTATTATGAT 


GTTGAGAATT 


GGGAATATGT 


AAATAAGAGC 


AAGAGAGCTC 


1140 


CAAGTGATAC 


AGGCACTTGG 


GTTAAAATCA 


TCAACAAGTA 


CATGGACACG 


ATGAAGCAGG 


1200 


C GGGTTATC A 


AAATGTGTAT 


GTCTATAGCT 


ATCGTAGTTT 


ATT AC AG AC G 


CGTTTAAAAC 


1260 


ACCCAGATAT 


TTTAAAACAT 


GTAAACTGGG 


TAGCGGCCTA 


TACGAATGCT 


TTAGAATGGG 


1320 


AAAACCCTCA 


TTATTCAGGA 


AAAAAAGGTT 


GGCAATATAC 


CTCTTCTGAA 


TACATGAAAG 


1380 


GAATCCAAGG 


GCGCGTAGAT 


GTCAGCGTTT 


GGTAT 






1415 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Val Ala Ala Asn Glu Thr Glu Val Ala Lys Thr Ser Gin Asp Thr Thr 
15 10 15 

Thr Ala Ser Ser Ser Ser Glu Gin Asn Gin Ser Ser Asn Lys Thr Gin 
20 25 30 

Thr Ser Ala Glu Val Gin Thr Asn Ala Ala Ala His Trp Asp Gly Asp 
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35 40 45 

Tyr Tyr Val Lys Asp Asp Gly Ser Lys Ala Gin Ser Glu Trp He Phe 
50 55 60 

Asp Asn Tyr Tyr Lys Ala Trp Phe Tyr He Asn Ser Asp Gly Arg Tyr 
65 70 75 80 

Ser Gin Asn Glu Trp His Gly Asn Tyr Tyr Leu Lys Ser Gly Gly Tyr 

90 95 
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Met Ala Gin Asn Glu Trp He Tyr Asp Ser Asn Tyr Lys Ser Trp Phe 
100 105 HO 

Tyr Leu Lys Ser Asp Gly Ala Tyr Ala His Gin Glu Trp Gin Leu He 
115 120 125 

Gly Asn Lys Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Ala Lys Ser 
130 135 140 

Gin Trp Gin Gly Ser Tyr Phe Leu Asn Gly Gin Gly Ala Met Met Gin 
145 150 155 160 

Asn Glu Trp Leu Tyr Asp Pro Ala Tyr Ser Ala Tyr Phe Tyr Leu Lys 
165 170 175 

Ser Asp Gly Thr Tyr Ala Asn' Gin Glu Trp Gin Lys Val Gly Gly Lys 
180 185 190 

Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met. Ala Arg Asn Glu Trp Gin 
195 200 205 

Gly Asn Tyr Tyr Leu Thr Gly Ser Gly Ala Met Ala Thr Asp Glu Val 
210 215 220 

He Met Asp Gly Thr Arg Tyr He Phe Ala Ala Ser Gly Glu Leu Lys 
225 230 235 240 

Glu Lys Lys Asp Leu Asn Val Gly Trp Val His Arg Asp Gly Lys Arg 
245 250 255 

Tyr Phe Phe Asn Asn Arg Glu Glu Gin Val Gly Thr Glu His Ala Lys 
260 265 270 

LYS val He Asp He Ser Glu- His Asn Qly Arg He Asn Asp Trp Lys 
275 280 285 

Lys Val He Asp Glu Asn Glu Val Asp Gly Val He Val Arg Leu Gly 
290 295 300 

Tyr Ser Gly Lys Glu Asp Lys Glu Leu Ala His Asn He Lys Glu Leu 
305 310 315 320 

Asn Arg Leu Gly He Pro Tyr Gly Val Tyr Leu Tyr Thr Tyr Ala Glu 
325 330 335 

Asn Glu Thr Asp Ala Glu Ser Asp Ala Lys Gin Thr He Glu Leu lie 
340 345 350 

Lys Lys Tyr Asn Met Asn Leu Ser Tyr Pro He Tyr Tyr Asp Val Glu 
355 360 365 

Asn Trp Glu Tyr Val Asn Lys Ser Lys Arg Ala Pro Ser Asp Thr Gly 
370 375 380 . 
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Thr Trp Val Lys He He Asn Lys Tyr Met Asp Thr Met Lys Gin Ala 
385 390 395 400 

Glv Tyr Gin Asn Val Tyr Val Tyr Ser Tyr Arg Ser Leu Leu Gin Thr 
405 410 - 415 

Arg Leu Lys His Pro Asp He Leu Lys His Val Asn Trp Val Ala Ala 
420 425 430 

Tyr Thr Asn Ala Leu Glu Trp Glu Asn Pro His Tyr Ser Gly Lys Lys 
435 440 445 

Gly Trp Gin Tyr Thr Ser Ser Glu Tyr Met Lys Gly He Gin Gly Arg 
450 455 460 

Val Asp Val Ser Val Trp Tyr 
465 470 

(2) INFORMATION FOR SEQ ID NO: 15 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

TACGTCTCAG CCTACTTTTG TAAGAGCAGA AGAATCTCCA CAAGTTGTCG AAAAATC TTC 60 

ATTAGAGAAG AAATATGAGG AAGCAAAAGC AAAAGC TG AT ACTGCCAAGA AAGATT AC G A 12 0 

AAC GGC T AAA AAGAAAGCAG AAGACGCTCA GAAAAAGTAT GAAGATGATC AGAAGAGAAC 18 0 

TGAGGAGAAA GCTCGAAAAG AAGCAGAAGC ATCTCAAAAA TTGAATGATG TGGCGCTTGT 240 

TGTTCAAAAT GCATATAAAG AGT AC C GAGA AGTTCAAAAT CAACGTAGTA AATATAAATC 3 00 

TG AC GC TG AA TATCAGAAAA AATTAACAGA GGTCGACTCT AAAATAGAGA AGGCTAGGAA 3 60 

AGAGCAACAG G AC TTGC AAA ATAAATTTAA TGAAGTAAGA GCAGTTGTAG TTCCTGAACC 42 0 

AAATGC GTTG GCTGAGACTA AGAAAAAAGC AG AAGAAGC T AAAGCAGAAG AAAAAGTAGC 4 80 

TAAGAGAAAA TATGATTATG CAACTCTAAA GGTAGCACTA GCGAAGAAAG AAGTAGAGGC 540 

TAAGGAACTT GAAATTGAAA AACTTCAATA TGAAATTTCT AC TTTGG AAC AAGAAGTTGC 600 

TACTGCTCAA CATCAAGTAG ATAATTTGAA AAAACTTCTT GCTGGTGCGG ATCCTGATGA 660 

TGGCACAGAA GTTATAGAAG CTAAATTAAA AAAAGGAGAA GCTGAGCTAA ACGCTAAACA 72 0 

AGCTGAGTTA GCAAAAAAAC AAACAGAACT TGAAAAACTT CTTGACAGCC TTGATCCTGA 780 

AGGTAAGACT CAGGATGAAT TAGATAAAGA AGCAGAAGAA GCTGAGTTGG ATAAAAAAGC 840 

TGATGAACTT CAAAATAAAG TTGCTGATTT AGAAAAAGAA ATT AGT AAC C TTGAAATATT 900 

AC TTGG AGGG GCTGATNCTG AAGATGATAC TGCTGCTCTT CAAAATAAAT TAGCTACTAA 9 60 

AAAAGCTGAA TTGGAAAAAA CTCAAAAAGA ATTAGATGCA GCTCTTAATG AGTTAGGCCC 102 0 
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TGATGGAGAT GAAGAAGAAA CTCCAGCGCC GGCTCCTCAA CCAGAGCAAC CAGCTCCTGC 1080 

ACCAAAACCA GAGCAACCAG CTCCAGCTCC AAAAC C AG AG CAACCAGCTC CTGCACCAAA 1140 

ACCAGAGCAA CCAGCTCCAG CTCCAAAACC AGAGCAACCA GCTCCAGCTC CAAAACCAGA 12 00 

GCAACCAGCT AAGCCGGAGA AACCAGCTGA AGAGCCTACT CAACCAGAAA AACCAGCCAC 12 60 

TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAT TTCTACAATA CTGATGGTTC 132 0 

AATGGCAATA GGTTGGCTCC AAAAC AACGG TTCATGGTAC TACCTAAACG CTAACGGCGC 13 8 0 

TATGGCAACA GGTTGGGTGA AAGATGGAGA TACCTGGTAC TATCTTGAAG CATCAGGTGC 1440 

TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA ACAGCAATGG 1500 

CGCTATGGCG ACAGGCTGGC TCCAATACAA TGGCTCATGG TACTACCTCA ACGCTAATGG 1560 

TGATATGGCG AC AGG ATGGC TCCAATACAA CGGTTCATGG T ATT AC C TC A ACGCTAATGG 162 0 

TGATATGGCG ACAGGATGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 168 0 

TGCTATGGCT ACAGGTTGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 1740 

TTCAATGGCA ACAGGTTGGG TGAAAGATGG AGATACCTGG TACTATCTTG AAGCATCAGG 180 0 

TGCTATGAAA GCAAGCCAAT GGTTCAAAGT ATCAGATAAA TGGTACTATG TCAATGGCTT 186 0 

AGGTGCCCTT GCAGTCAACA C AAC TGTAGA TGGC TAT AAA GTCAATGCCA ATGGTGAATG 192 0 
GGTT 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Thr Ser Gin Pro Thr Phe Val Arg Ala Glu Glu Ser Pro Gin Val Val 
x 5 10 15 

Glu Lys Ser Ser Leu Glu Lys Lys Tyr Glu Glu Ala Lys Ala Lys Ala 
20 25 30 

Asp Thr Ala Lys Lys Asp Tyr Glu Thr Ala Lys Lys Lys Ala Glu Asp 
35 40 45 

Ala Gin Lys Lys Tyr Glu Asp Asp Gin Lys Arg Thr Glu Glu Lys Ala 
50 55 60 

Arg Lys Glu Ala Glu Ala Ser Gin Lys Leu Asn Asp Val Ala Leu Val 
65 70 75 80 

Val Gin Asn Ala Tyr Lys Glu Tyr Arg Glu Val Gin Asn Gin Arg Ser 
85 90 95 



157 



Lvs Tyr Lys Ser Asp Ala Glu Tyr Gin Lys Lys Leu Thr Glu Val Asp 
105 HO 



100 



Ser Lys He Glu Lys Ala Arg Lys Glu Gin Gin Asp Leu Gin Asn Lys 
115 120 125 

Phe Asn Glu Val Arg Ala Val Val Val Pro Glu Pro Asn Ala Leu Ala 
130 135 140 

Glu Thr Lys Lys Lys Ala Glu Glu Ala Lys Ala Glu Glu Lys Val Ala 
145 150 155 160 

Lys Arg Lys Tyr Asp Tyr Ala Thr Leu Lys Val Ala Leu Ala Lys Lys 
165 I 70 175 

Glu Val Glu Ala Lys Glu Leu Glu He Glu Lys Leu Gin Tyr Glu He 
180 185 190 

Ser Thr Leu Glu Gin Glu Val Ala Thr Ala Gin His Gin Val Asp Asn 
200 205 
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Leu Lys Lys Leu Leu Ala Gly Ala Asp Pro Asp Asp Gly Thr Glu Val 
210 215 220 

He Glu Ala Lys Leu Lys Lys Gly Glu Ala Glu Leu Asn Ala Lys Gin 
225 230 235 240 

Ala Glu Leu Ala Lys Lys Gin Thr Glu Leu Glu- Lys Leu Leu Asp Ser 
245 250 255 

Leu Asp Pro Glu Gly Lys Thr Gin Asp Glu Leu Asp Lys Glu Ala Glu 
260 265 270 

Glu Ala Glu Leu Asp Lys Lys Ala Asp Glu' Leu Gin Asn Lys Val Ala 
275 280 285 

Aso Leu Glu Lys Glu He Ser Asn Leu Glu He Leu Leu Gly Gly Ala 
290 295 300 

Asp Xaa Glu Asp Asp Thr Ala Ala Leu Gin Asn Lys Leu Ala Thr Lys 
305 310 315 320 

Lvs Ala Glu Leu Glu Lys Thr Gin Lys Glu' Leu Asp Ala Ala Leu Asn 
y 325 330 .335 

Glu Leu Gly Pro Asp Gly Asp Glu Glu Glu Thr Pro Ala Pro Ala Pro 
340 345 350 

Gin Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro 
355 360 365 

Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro 
370 375 380 

Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu 
385 390 395 400 

Gin Pro Ala Lys Pro Glu Lys Pro Ala Glu Glu Pro Thr Gin Pro Glu 
405 410 415 

Lys Pro Ala Thr Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp 
* 425 430 
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Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala He Gly Trp Leu Gin Asn 
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435 440 445 

A^n Glv Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly 
450 4 55 460 

Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala 
465 47 ° 475 480 

Met Lys Ala Ser Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val 
485 490 493 

Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser 
500 505 510 

Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Leu Gin 

515 520 525 

Tvr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr 
530 535 540 

Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 

545 550 555 

Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu 
565 570 3/3 

Asn Ala< Asn Gly Ser Met Ala Thr Gly Trp Val Lys Asp Gly Asp Thr 
580 585 3yu 

Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala Ser Gin Trp Phe 
v * 595 600 605 

Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Leu Gly Ala Leu Ala 
610 615 620 

Val Asn Thr Thr Val Asp Gly Tyr Lys Val Asn Ala Asn Gly Glu Trp 
625 630 635 640 

Val 

(2) INFORMATION FOR SEQ ID NO : 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TGGACAGGTG AAAGGTCATG CTACATTTGT GAAATC C ATG AC AAC TGAAA TGTACCAAGA 60 

ACAACAGAAC CATTCTCTCG CCTACAATCA ACGCTTGGNT TCGCAAAATC GCATTGTAGA 120 
TCCTTTTTTG GC GGAGGG AT ATGAGGTCAA TTACCAAGTG TCTGACGACC CTGATGCAGT 

CTATGGTTAC TTGTCTATTC CAAGTTTGGA AATCATGGAG CCGGTTTATT TGGGAGCAGA 240 

TTATCATCAT TTAGGGATGG GCTTGGCTCA TGTGGATGGT ACACCGCTGC CTCTGGATGG 3 00 
TACAGGGATT CGCTCAGTGA TTGCTGGGCA CCGTGCAGAG CCAAGCCATG TCTTTTTCCG 



180 



360 
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CCATTTGGAT CAGCTAAAAG TTGGAGATGC TCTTTATTAT GATAATGGCC AGGAAATTGT 42 0 

AGAATATCAG ATGATGGACA CAGAGATTAT TTTACCGTCG G AAT GGG AAA AATTAGAATC 480 

GGTTAGCTCT AAAAATATCA TGACCTTGAT AACCTGCGAT CCGATTCCTA CCTTTAATAA 540 

AC GC TT ATT A GTGAATTTTG AACGAGTCGC TGTTTATCAA AAATCAGATC CACAAACAGC 6 00 

TGCAGTTGCG AGGGTTGCTT TT AC GAAAGA AGGACAATCT GTATCGCGTG TTGCAACCTC 6 60 
TCAATGGTTG 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

Glv Gin Val Lys Gly His Ala' Thr Phe Val Lys Ser Met Thr Thr Glu 
15 10 15 

Met Tyr Gin Glu Gin Gin Asn His Ser Leu Ala Tyr Asn Gin Arg Leu 
20 25 30 

Xaa Ser Gin Asn Arg He Val Asp Pro Phe Leu Ala Glu Gly Tyr Glu 
35 40 45 

Val Asn Tyr Gin Val Ser Asp Asp Pro Asp Ala Val Tyr Gly Tyr Leu 
50 55 60 

Ser He Pro Ser Leu Glu He Met Glu Pro Val Tyr Leu Gly Ala Asp 
65 70 75 80 

Tyr His His Leu Gly Met Gly Leu Ala His Val Asp Gly Thr Pro Leu 
85 90 95 

Pro Leu Asp Gly Thr Gly He Arg Ser Val He Ala Gly His Arg Ala 
100 105 HO 

Glu Pro Ser His Val Phe Phe Arg His Leu Asp Gin Leu Lys Val Gly 
115 120 125 

Asp Ala Leu Tyr Tyr Asp Asn Gly Gin Glu He Val Glu Tyr Gin Met 
130 135 140 

Met Asp Thr Glu He He Leu Pro Ser Glu Trp Glu Lys Leu Glu Ser 
145 150 155 160 

Val Ser Ser Lys Asn He Met Thr Leu He Thr Cys Asp Pro He Pro 
165 170 175 

Thr Phe Asn Lys Arg Leu Leu Val Asn Phe Glu Arg Val Ala Val Tyr 
180 185 190 

Gin Lys Ser Asp Pro Gin Thr Ala Ala Val Ala Arg Val Ala Phe Thr 
195 200 205 
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Lys Glu Gly Gin Ser Val Ser Arg Val Ala Thr Ser Gin Trp Leu 
210 215 . 220 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: ;. 

(A) LENGTH: 7 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

GATTGCTCCT TTGAAGGATT TGAGAGAAAC CATGTTGGAA ATTGCTTCTG GTGCTCAAAA 6 0 

TCTTCGTGCC AAGGAAGTTG GTGCCTATGA AC TGAG AGAA GTAACTCGCC AATTTAATGC 12 0 

TATGTTGGAT CAGATTGATC AGTTGATGGT AGCTATTCGT AGCCAGGAAG AAACGACCCG 18 0 

TCAGTACCAA CTTCAAGCCC TTTCGAGCCA GATTAATCCA CATTTCCTCT ATAACACTTT 240 

GGACACCATC ATCTGGATGG CTGAATTTCA TGATAGTCAG CGAGTGGTGC AGGTGACCAA 3 00 

GTCCTTGGCA ACCTATTTCC GCTTGGCGCT CAATCAAGGC AAGGACTTGA TTTGTCTCTC 3 60 

TGAC GAAATC AATCATGTCC GCCAGTATCT CTTTATCCAG AAACAACGCT ATGGAGATAA 42 0 

GCTGGAATAC GAAATTAATG AAAATGTTGC CTTTGATAAT TTAGTCTTAC CCAAGCTGGT 480 

CCTACAACCC C TTGT AG AAA ATGCTCTTTA CCATGGCATT AAGGAAAAGG AAGGTCAGGG 540 

C CAT ATT AAA CTTTCTGTCC AGAAACAGGA TTC GGGATTG GTCATCCGTA TTGAGGATGA 6 00 

TGGCGTTGGC TTC C AAGATG CTGGTGATAG TAGTCAAAGT CAACTCAAAC GTGGGGGAGT 660 

TGGTCTTCAA AATGTCGATC AACGGCTCAA AC TTC ATTTT GGAGCCAATT AC C AT ATGAA 72 0 

GATTGATTCT AG AC C C C AAA AAGGGAC GAA AGTTGAAATA TATATAAATA GAATAGAAAC 7 80 
TAGC 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

He Ala Pro Leu Lys Asp Leu Arg Glu Thr Met Leu Glu He Ala Ser 
1 5 10 15 

Gly Ala Gin Asn Leu Arg Ala Lys Glu Val Gly Ala Tyr Glu Leu Arg 
20 25 30 

Glu Val Thr Arg Gin Phe Asn Ala Met Leu Asp Gin He Asp Gin Leu 
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35 40 45 

Met val Ala He Arg Ser Gin Glu Glu Thr Thr Arg Gin Tyr Gin Leu 
50 55 60 

Gin Ala Leu Ser Ser Gin He Asn Pro His Phe Leu Tyr Asn Thr Leu 
65 70 75 80 

Asp Thr He He Trp Met Ala Glu Phe His Asp Ser Gin Arg Val Val 
85 90 95 

Gin Val Thr Lys Ser Leu Ala Thr Tyr Phe Arg Leu Ala Leu Asn Gin 
100 105 HO 

Glv Lvs Asp Leu He Cys Leu Ser Asp Glu He Asn His Val Arg Gin 
115 120 125 

Tyr Leu Phe He Gin Lys Gin Arg Tyr Gly Asp Lys Leu Glu Tyr Glu 
130 135 140 

He Asn Glu Asn Val Ala Phe Asp Asn Leu Val Leu Pro Lys Leu Val 
145 150 155 160 

Leu Gin Pro Leu Val Glu Asn Ala Leu Tyr His Gly He Lys Glu Lys 
165 170 175 

Glu Gly Gin Gly His He Lys" Leu Ser Val Gin Lys Gin Asp Ser Gly 
180 185 190 

Leu Val He Arg He Glu Asp Asp Gly Val Gly Phe Gin Asp Ala Gly 
195 200 205 

Asp Ser Ser Gin Ser Gin Leu Lys Arg Gly Gly Val Gly Leu Gin Asn 
210 215 220 

Val Asp Gin Arg Leu Lys Leu His Phe Gly Ala Asn Tyr His Met Lys 
225 230 235 240 

He Asp Ser Arg Pro Gin Lys Gly Thr Lys Val Glu He Tyr lie Asn 
245 250 255 

Arg He Glu Thr Ser 
260 

(2) INFORMATION FOR SEQ ID NO : 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 165: 
TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGGCTC C AT AATATC T ATAAGGGATT 
TACCCACTAC AAATATTATA GAGCCGAAAA TTCACATCTA ATATATGCAG ACTACTTTGA 
AATGAAATTA AAAAAATTAT TAAAGGATGA CACAAAAGTT TTTGAAAAAT CTACATTCAA 
ATTTGTAGAA GGATATAAAA T AT AC CTGAC AG AATC T AAA GAATCTGGAA TTAAACAAAT 
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GGACAATGTC ATAAAATATT TTGAGTTTAT TG AATC T AAA AGTATTGCTT T AT ATTTTC A 3 00 

AAAAC GATT A AATGAGCTGA TAGAT 325 
(2) INFORMATION FOR SEQ ID NO: 16 6: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Arg Ser Tyr Gly Thr Phe Phe Leu Gin Gin Asn Arg Leu His Asn lie 
<L 5 10 15 

Tvr Lvs Gly Phe Thr His Tyr Lys Tyr Tyr Arg Ala Glu Asn Ser His 
20 25 30 

Leu He Tyr Ala Asp Tyr Phe Glu Met Lys Leu Lys Lys Leu Leu Lys 
35 40 45 

Asp Asp Thr Lys Val Phe Glu Lys Ser Thr Phe Lys Phe Val Glu Gly 
50 55 60 

Tyr Lys He Tyr Leu Thr Glu Ser Lys Glu Ser Gly He Lys Gin Met 
65 70 75 80^ 

Asp Asn Val He Lys Tyr Phe Glu Phe He Glu Ser Lys Ser He Ala 
85 90 95 

■ Leu Tyr Phe Gin Lys Arg Leu Asn Glu Leu He Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CAACGTTGAG AATTATTTGC GAATGTGTTT GGATAGCATT CAGAATCAGA CGTATCAAAA 6 0 

TTTTGAGTGT TTATTAATCA ATGATGGCTC TCCAGATCAT TCATCCAAAA TATGTGAAGA 12 0 

ATTTGTAGAG AAAGATTCTC GTTTCAAATA TTTTGAGAAA GCAAACGGCG GTCTTTCATC 18 0 

AGCTCGTAAC CTAGGTATTG AATGTTCGGG GGGGGGC GTA CATTACTTTT GTAGACTC 23 8 
(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



163 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Asn Val Glu Asn Tyr Leu Arg Met Cys Leu Asp Ser He Gin Asn Gin 
15 10 15 

Thr Tyr Gin Asn Phe Glu Cys Leu Leu He Asn Asp Gly Ser Pro Asp 
20 25 30 

His Ser Ser Lys He Cys Glu Glu Phe Val Glu Lys Asp Ser Arg Phe 
35 40 45 

Lvs Tyr Phe Glu Lys Ala Asn Gly Gly Leu Ser Ser Ala Arg Asn Leu 
50 55 60 

Gly He Glu Cys Ser Gly Gly Gly Val His Tyr Phe Cys Arg Leu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 742 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CTACTATCAA TCAAGTTCTT CAGCCATTGA GGCCACCATT GAGGGCAACA GCCAAACGAC 60 

CATCAGCCAG ACTAGCCACT TTATTCAGTC TTATATCAAA AAACTAGAAA CCACCTCGAC 12 0 

TGGTTTG AC C CAGCAGACGG ATGTTCTGGC CTATGCTGAG AATCCCAGTC AAGACAAGGT 180 

CGAGGGAATC CGAGATTTGT TTTTGAC CAT CTTGAAGTCA GATAAGGACT TGAAAAC TGT 240 

TGTGCTGGTG ACCAAATCTG GTCAGGTCAT TTCTACAGAT GACAGTGTGC AGATGAAAAC 3 00 

TTCCTCTGAT ATGATGGCTG AGGATTGGTA CCAAAAGGCC ATTCATCAGG GAGCTATGCC 360 

TGTTTTG AC T CCAGCTCGTA AATCAGATAG TCAGTGGGTC ATTTC TGTC A CTCAAGAACT 420 

TGTTGATGCA AAGGGAGCCA ATCTTGGTGT GCTTCGTTTG GATATTTCTT ATGAAACTCT 480 

GGAAGCCTAT CTCAATCAAC TCCAGTTGGG GCAGCAGGGC TTTGC CTTC A TTATCAATGA 540 

AAACCATGAA TTTGTCTACC ATCCTCAACA CACAGTTTAT AGTTCGTCTA GCAAAATGGA 60 0 

GGCTATGAAA CCCTACATCG ATACAGGTCA . GGGTT AT AC T CCTGGTCACA AATCCTACGT 660 

CAGTCAAGAG AAGATTGCAG GAAC TGATTG GACGGTGCTT GGCGTGTCAT CATTGGAAAA 72 0 
GTTAGACCAG GTTCGGAGTC AG 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 



742 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Tyr Tyr Gin Ser Ser Ser Ser Ala He Glu Ala Thr He Glu Gly Asn 

Ser Gin Thr Thr He Ser Gin Thr Ser His Phe He Gin Ser Tyr He 
20 25 JU 

Lys Lys Leu Glu Thr Thr Ser Thr Gly Leu Thr Gin Gin Thr Asp Val 
35 40 5 

Leu Ala Tyr Ala Glu Asn Pro Ser Gin Asp Lys Val Glu Gly He Arg 
50 55 60 

Asp Leu Phe Leu Thr He Leu Lys Ser Asp Lys Asp Leu Lys Thr Val 
65 70 75 80 

Val Leu Val Thr Lys Ser Gly Gin Val He Ser Thr Asp Asp Ser Val 
85 90 a:> 

Gin Met Lys Thr Ser Ser Asp Met Met Ala Glu Asp Trp Tyr Gin Lys 
100 105 1 

Ala He His Gin Gly Ala Met Pro Val Leu Thr Pro Ala Arg Lys Ser 
115 120 125 

Asp Ser Gin Trp Val He Ser Val Thr Gin Glu Leu Val Asp Ala Lys 
130 135 l^O 

Gly Ala Asn Leu Gly Val Leu Arg Leu Asp lie Ser Tyr Glu Thr Leu 
14 5 150 155 J-t." 

Glu Ala Tyr Leu Asn Gin Leu Gin Leu Gly Gin Gin Gly Phe Ala Phe 
165 170 1/0 

He He Asn Glu Asn His Glu Phe Val Tyr His Pro Gin His Thr Val 
180 185 190 

Tyr Ser Ser Ser Ser Lys Met Glu Ala Met Lys Pro Tyr He Asp Thr 
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Gly Gin Gly Tyr Thr Pro Gly His Lys Ser Tyr Val Ser Gin Glu Lys 
210 215 220 

lie Ala Gly Thr Asp Trp Thr Val Leu Gly Val Ser Ser Leu Glu Lys 
230 235 



225 



Leu Asp Gin Val Arg Ser Gin 
245 

) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 171: 



GACAAAAACA 


TTAAAACGTC 


CTGAGGTTTT 


ATCACCTGCA 


GGGACTTTAG 


TV Tl TV /~% TV TV 7V 

AG AAGC T AAA 


b U 


GGTAGCTGTT 


CAGTATGGAG 


CAGATGCTGT 


CTTTATCGGT 


GGTCAGGCCT 


ATGGTCTTCG 




TAGCCGTGCG 


GGAAAC TTTA 


CTTTCGAACA 


GATGGAAGAA 


GGCGTGCAGT 


TTGCGGCCAA 




GTATGGTGCC 


AAGGTCTATG 


TAGCGGCTAA 


TATGGTTATG 


CACGAAGGAA 


ATGAAGCTGG 


O A A 

z4U 


TGCTGGTGAG 


TGGTTCCGTA 


AACTGCGTGA 


TATCGGGATT 


GCAGCAGTTA 


TCGTATCTGA 


0 a a 

1 (JU 


CCCAGCCTTG 


ATTATGATTG 


CAGTGACTGA 


AGCACCAGGC 


CTTGAAATCC 


ACCTTTCTAC 


~i e a 

3 60 


CCAAGCCAGT 


GCCACTAACT 


ATGAAACCCT 


TGAGTTCTGG 


AAAGAGC TAG 


GCTTGACTCG 


42 0 


TGTCGTTTTA 


GC GC GTGAGG 


TTTCAATGGA 


AGAATTAGCT 


GAGATC C GC A 


AAC GTAC AGA 


vl O A 


TGTTGAAATT 


GAAGCCTTTG 


TCCATGGAGC 


TATGTGTATT 


TCATACTCTG 


G AC GTTGT AC 


o4U 


TCTTTCAAAC 


CACATGAGTA 


TGCGTGATGC 


CAACCGTGGT 


GGATGTTCTC 


AGTC ATGC C G 


c a a 


TTGGAAATAC 


GACCTTTACG 


ATATGCCATT 


TGGGAAAGAA 


CGTAAGAGTT 


TGCAGGGTGA 


£~ £T A 


GATTCCAGAA 


GAATTTTCAA 


TGTCAGCCGT 


TGACATGTCT 


ATGATTGACC 


ANATTCCAGA 


"7 O A 

720 


TATGATTGAA 


AATGGTGTGG 


AC AGTC T AAA 


AATC GAAGGA 


C GTATGNAGT 


L. TA i i L. AL. 1 A 


7 fi n 


NGTATCAACA 


GTAACCAACT 


GC T AC AAGGC 


GGCTGTGGAT 


GCCTATCTTG 


AAAGTCCTGA 


840 


AAAGTTTGAA 


GCTATCAAAC 


AAGACTTGGT 


GGACGAGATG 


TGGAAGGTTG 


CCCAACGTGA 


900 


ACTGGCTACA 


GGATTTTACT 


ATGGTACACC 


ATC TGAAAAT 


GAGCAGTTGT 


TTGGTGCTCG 


960 


TCGTAAAATC 


CCTGAGTACA 


AGTTTGTCGC 


TGAAGTGGTT 


TCTTATGATG 


ATGCGGCACA 


1020 


AACAGCAACT 


ATTCGTCAAC 


GAAACGTCAT 


TAACGAAGGG 


GACCAAGTTG 


AGTTTTATGG 


1080 


TCCAGGTTTC 


CGTCATTTTG 


AAACCTATAT 


TGAAGATTTG 


CATGATGCTA 


AAGGC AATAA 


1140 


AATCGACCGC 


GCTCCAAATC 


CAATGGAACT 


ATTGACTATT 


AAAGTC C C AC 


AAC C TGTTC A 


1200 


ATCAGGAGAC 


ATGGTTC GAG 


CTCTTAAAGA 


GGGGCTTATC 


AATCTTTATA 


AGGAAGATGG 


1260 


AACCAGCGTC 


ACAGTTCGTG 


CT 








1282 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Thr Lys Thr Leu Lys Arg Pro Glu Val Leu Ser Pro Ala Gly Thr Leu 
1 5 10 15 



166 



Glu Lys Leu Lys Val Ala Val Gin Tyr Gly Ala Asp Ala Val Phe lie 
20 25 30 

Gly Gly Gin Ala Tyr Gly Leu Arg Ser Arg Ala Gly Asn Phe Thr Phe 
35 40 45 

Glu Gin Met Glu Glu Gly Val Gin Phe Ala Ala Lys Tyr Gly Ala Lys 
50 55 60 

Val Tyr Val Ala Ala Asn Met Val Met His Glu Gly Asn Glu Ala Gly 
65 70 75 80 

Ala Gly Glu Trp Phe Arg Lys Leu Arg Asp lie Gly lie Ala Ala Val 
85 90 95 

lie Val Ser Asp Pro Ala Leu lie Met lie Ala Val Thr Glu Ala Pro 
100 105 110 

Gly Leu Glu lie His Leu Ser Thr Gin Ala Ser Ala Thr Asn Tyr Glu 
115 120 125 

Thr Leu Glu Phe Trp Lys Glu Leu Gly Leu Thr Arg Val Val Leu Ala 
130 135 140 

Arg Glu Val Ser Met Glu Glu Leu Ala Glu lie Arg Lys Arg Thr Asp 
145 150 155 160 

Val Glu lie Glu Ala Phe Val His Gly Ala Met Cys He Ser Tyr Ser 
165 170 175 

Glv Arg Cys Thr Leu Ser Asn His Met Ser Met Arg Asp Ala Asn Arg 
180 185 190 

Gly Gly Cys Ser Gin Ser Cys Arg Trp Lys Tyr Asp Leu Tyr Asp Met 
195 , 200 205 

Pro Phe Gly Lys Glu Arg Lys Ser Leu Gin Gly Glu He Pro Glu Glu 
210 215 220 

Phe Ser Met Ser Ala Val Asp Met Ser Met He Asp Xaa He Pro Asp 
225 230 235 240 

Met He Glu Asn Gly Val Asp Ser Leu Lys He Glu Gly Arg Met Xaa 
245 250 255 

Ser He His Xaa Val Ser Thr Val Thr Asn Cys Tyr Lys Ala Ala Val 
260 265 270 

Asp Ala Tyr Leu Glu Ser Pro Glu Lys Phe Glu Ala He Lys Gin Asp 
275 280 285 

Leu Val Asp Glu Met Trp Lys Val Ala Gin Arg Glu Leu Ala Thr Gly 
290 295 300 

Phe Tyr Tyr Gly Thr Pro Ser Glu Asn Glu Gin Leu Phe Gly Ala Arg 
305 310 315 320 

Arg Lys He Pro Glu Tyr Lys Phe Val Ala Glu Val Val Ser Tyr Asp 
325 330 335 

Asp Ala Ala Gin Thr Ala Thr He Arg Gin Arg Asn Val He Asn Glu 
340 345 350 
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Glv Asp Gin Val Glu Phe Tyr Gly Pro Gly Phe Arg His Phe Glu Thr 
355 360 365 

Tvr lie Glu Asp Leu His Asp Ala Lys Gly Asn Lys lie Asp Arg Ala 
370 375 380 

Pro Asn Pro Met Glu Leu Leu Thr He Lys Val Pro Gin Pro Val Gin 
385 390 395 400 

Ser Gly Asp Met Val Arg Ala Leu Lys Glu Gly Leu He Asn Leu Tyr 
405 410 415 

Lys Glu Asp Gly Thr Ser Val Thr Val Arg Ala 
420 425 

(2) INFORMATION FOR SEQ ID NO: 173: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TTCTCAGGAG ACCTTTAAAA ATATCACCAA TAGCTTCTCC ATGCAAATCA ATCGTCGCGT 60 

CAACCAAGGA ACGCCTCGTG GTGC TGGGAA TATCAAGGGT GAAGACATCA AAAAAATCAC 12 0 

CGAAAACAAG GCCATTGAGT CTTATGTCAA AC GTATC AAC GCTATCGGAG ATTTG AC TGG 18 0 

AT ATGAC C T G ATTGAAACGC CAGAAACCAA GAAGAATCTC ACTGCTGATC GTGCCAAGCG 240' 

TTTTGGAAGT AGCTTGATGA TTACAGGTGT CAATGACTCC TC T AAAGAAG ACAAGTTTGT 3 00 

CTCTGGTTCT TATAAACTAG TC GAAGGAG A GCACTTAACC AACGACGACA AGGATAAAAT 3 60 

CCTCTTGCAC AAGGACTTGG CAGCCAAACA CGGCTGGAAA GTAGGGGACA AGGTTAAACT 42 0 

GGACTCTAAT ATCTACGATG CAGATAATGA AAAAGGAGC C AAGG AAAC AG TTGAAGTGAC 480 

AATC AAGGGA CTCTTTGATG GTCATAATAA GTCAGCAGTA ACCTACTCAC AAGAACTTTA 540 
C GAAAAC AC A GCTATTACAG ACATTCACAC TGCTGCAAAA CTTTATGGAT ACACAGAAGA 
CACAGCCATT TATGGGGACG CAACCTTCTT TGTAACAGCA GACAAGAACT TGGATGATGT 

TATGAAAGAG TTGAAT GGC A TC AGTGGTAT CAACTGGAAG AGC TAC AC AC TCGTCAAGAG 72 0 

CTCCTCTAAC TACCCAGCTC TTGAGCAATC TATCTCTGGT ATGTACAAGA TGGCCAAC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



600 
660 



778 



168 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Ser Gin Glu Thr Phe Lys Asn He Thr Asn Ser Phe Ser Met Gin He 
1 5 10 15 

Asn Arg Arg Val Asn Gin Gly Thr Pro Arg Gly Ala Gly Asn He Lys 
20 25 30 

Gly Glu Asp He Lys Lys He Thr Glu Asn Lys Ala He Glu Ser Tyr 
35 40 45 

Val Lys Arg He Asn Ala He Gly Asp Leu Thr Gly Tyr Asp Leu He 
50 55 60 

Glu Thr Pro Glu Thr Lys Lys Asn Leu Thr Ala Asp Arg Ala Lys Arg 
65 70 75 80 

Phe Glv Ser Ser Leu Met He Thr Gly Val Asn Asp Ser Ser Lys Glu 
85 90 95 

Asp Lys Phe Val Ser Gly Ser Tyr Lys Leu Val Glu Gly Glu His Leu 
100 105 HO 

Thr Asn Asp Asp Lys Asp Lys He Leu Leu His Lys Asp Leu Ala Ala 
115 120 125 

Lvs His Gly Trp Lys Val Gly Asp Lys Val Lys Leu Asp Ser Asn He 
130 135 140 

Tyr Asp Ala Asp Asn Glu Lys Gly Ala Lys Glu Thr Val Glu Val Thr 
145 150 155 160 

He Lys Gly Leu Phe Asp Gly His Asn Lys Ser Ala Val Thr Tyr Ser. 

165 I 70 17b 

Gin Glu Leu Tyr Glu Asn Thr Ala He Thr Asp He His Thr Ala Ala 
180 185 190 

Lys Leu Tyr Gly Tyr Thr Glu Asp Thr Ala He Tyr Gly Asp Ala Thr 
195 200 *- 205 

Phe Phe Val Thr Ala Asp Lys Asn Leu Asp Asp Val Met Lys Glu Leu 
210 215 220 

Asn Gly He Ser Gly He Asn Trp Lys Ser Tyr Thr Leu Val Lys Ser 
225 230 235 

Ser Ser Asn Tyr Pro Ala Leu Glu Gin Ser He Ser Gly Met Tyr Lys 
245 250 25b 

Met Ala Asn 



) INFORMATION FOR SEQ ID NO : 17 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
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AGTAAATGCG CAATCAAATT CATTAATATT AATAGATGAA CCTGAAATCT CACTTCATCC 
GAGTGCAATC TATAAATTTA AAGAGTTTTT ACTTCAAGAG TGTTTAAATA AAAAACATCA 
AATTATTATC ACTACACATT CTACACAACT TATAAAAGAT TTTCCTAGAG AAGCCGTGAA 
ACTTTTAGTG AAAAACGGAG AAAAGGTAGA TGTTATTGAA AATATTGATT ATCAGGATGC 
ATTTTTTGAA TTAGGTGATG TGTATCATTC TAGGAAGATG ATTTATGTTG AAGATAGACT 
AGC TAAAT AT ATTCTAGAGT TTGTTATCAC TCATTCAGGT AGTGAGAATC TTAAACAGAA 
TTTAGTAGTG AGATATATTC CTGGTGGAGC AAATCAAATA ATTTGTAATA AT ATT TT AAA 
CTCATCGTAT TTAGATTCCG ATAACCATTA TTTTTGGCTT GATGGAGATC AAAACACTAA 
TGTTAGTGAA TCAAATAATT TAATGAACTA TCTTGAAAAT GGTGTTGTTA TATCAGATAA 
AATTCCTGAA TCAGATAATA AAAATCTTGA TGATATTATA AAATTGATAA NGGGATGTCC 
AATTAAATTT AATGTTTCAG GTAATAAAGG GCAAAAAAAT AATATTGAAT TAATTGC GAA 
ACAAAGAAGC TTTATAGATT ATTGGGC TAA ATAC 
(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Val Asn Ala Gin Ser Asn Ser Leu He Leu He Asp Glu Pro Glu He 
1 5 10 13 

Ser Leu His Pro Ser Ala He Tyr Lys Phe Lys Glu Phe Leu Leu Gin 
20 25 

Glu Cys Leu Asn Lys Lys His Gin He lie He Thr Thr His Ser Thr 
35 40 4= 

Gin Leu He Lys Asp Phe Pro Arg Glu Ala Val Lys Leu Leu Val Lys 

55 60 
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Asn Gly Glu Lys Val Asp Val He Glu Asn lie Asp Tyr Gin Asp Ala 
65 70 75 80 

Phe Phe Glu Leu Gly Asp Val Tyr His Ser Arg Lys Met He Tyr Val 
85 90 y:3 

Glu Asp Arg Leu Ala Lys Tyr lie Leu Glu Phe Val He Thr His Ser 
100 105 xx 

Gly Ser Glu Asn Leu Lys Gin Asn Leu Val Val Arg Tyr He Pro Gly 

115 120 izd 

Gly Ala Asn Gin He He Cys Asn Asn He Leu Asn Ser Ser Tyr Leu 

* 135 140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
694 



130 
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Asp Ser Asp Asn His Tyr Phe Trp Leu Asp Gly Asp Gin Asn Thr Asn 
- -- 150 155 itjU 
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Val Ser Glu Ser Asn Asn Leu Met Asn Tyr Leu Glu Asn Gly Val Val 
165 170 A 5 

He Ser Asp Lys lie Pro Glu Ser Asp Asn Lys Asn Leu Asp Asp He 
180 i85 

He Lys Leu He Xaa Gly Cys Pro lie Lys Phe Asn Val Ser Gly Asn 

mc 2f)0 
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Lys Gly Gin Lys Asn Asn He Glu Leu lie Ala Lys Gin Arg Ser Phe 
* 215 220 



210 



He Asp Tyr Trp Ala Lys Tyr 
225 230 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C ) STRAND EDNES S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17-7: 
TTACCGCGTT CATCAAGATG TCAAACAAGT CATGACCTAT CAACCCATGG TGCGAGAAAT 
ATTGAGTGAA CAAGACACCC CAGCAAACGA AG AGC TTGTG CTTGCTATGA TTTATACTGA 
AACAAAAGGA AAAGAAGGCG ATGTTATGC A GTCTAGTGAG TCTGCAAGTG GTTCCACCAA 
CACCATCAAT GATAATGCCT CTAGCATTCG GCAAGGCATT CAAACTCTGA CAGGCAATCT 
CTATCTGGCG CAGAAGAAGG GGGTAGATAT CTGGACAGCT GTTCAAGCCT ATAATTTTGG 
ACCTGCCTAT ATC GATTTT A TCGCCCAAAA TGGCAAGGAA AATACCCTGG CTCTAGCCAA 
ACAGTACTCT CGTGAGACTG TTGCCCCCTT GCTTGGTAAT AGGAC TGGAA AGACTTATAG 
TTATATTCAC CCCATTTCCA TTTTTC AC GG TGCTGAACTC TATGTAAATG GAGGAAACTA 
TTATTATTCT AG AC AGGT AC GACTTAACCT TTACATCATC AAATGTTTCA CTCTCTTTTC 
AACATCTGGC 

(2) INFORMATION FOR SEQ ID NO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 8: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
550 
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Tyr Arg Val His Gin Asp Val Lys Gin Val Met Thr Tyr Gin Pro Met 
1 5 10 15 

Val Ara Glu He Leu Ser Glu Gin Asp Thr Pro Ala Asn Glu Glu Leu 
20 25 30 

Val Leu Ala Met He Tyr Thr Glu Thr Lys Gly Lys Glu Gly Asp Val 
35 40 45 

Met Gin Ser Ser Glu Ser Ala Ser Gly Ser Thr Asn Thr He Asn Asp 
50 55 60 

Asn Ala Ser Ser He Arg Gin Gly He Gin Thr Leu Thr Gly Asn Leu 
65 70 75 80 

Tvr Leu Ala Gin Lys Lys Gly Val Asp He Trp Thr Ala Val Gin Ala 
85 90 95 

Tyr Asn Phe Gly Pro Ala Tyr He Asp Phe He Ala Gin Asn Gly Lys 
100 105 HO 

Glu Asn Thr Leu Ala Leu Ala Lys Gin Tyr Ser Arg Glu Thr Val Ala 
115 120 125 

Pro Leu Leu Gly Asn Arg Thr Gly Lys Thr Tyr Ser Tyr He His Pro 
130 135 140 

He Ser He Phe His Gly Ala Glu Leu Tyr Val Asn Gly Gly Asn Tyr 
145 150 155 , ■ 160 

Tvr Tyr Ser Arg Gin Val Arg Leu Asn Leu Tyr He He Lys Cys Phe 
165 170 175 

Thr Leu Phe Ser Thr Ser Gly 
180 

(2) INFORMATION FOR SEQ ID NO : 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) . TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GTGGATGGGC TTTAACTATC TTCGTATTCG CCGTGCGGCT AAAATTGTGG ACAATGAGGA 60 

GTTTGAAGCC TTGATTC GTA CGGGTCAATT GATTGATTTG CGCGACCCAG CAGAATTCCA 120 

CAGAAAACAT ATC C TTGGTG CACGCAATAT TCCTTCAAGT CAGTTGAAAA CTAGTCTTGC 180 

AGCCCTTCGT AAAGATAAAC CTGTCCTTCT CTACGAAAAC CAACGTGCGC AACGAGTTAC 2 40 

AAATGCAGCT CTTTACTTGA AAAAACAAGG TTTTTCTGAG ATTTATATCC TTTCTTATGG 3 00 

CTTGGATTCT TGGAAAGGGA AAGTGAAGAC TAGC 334 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: 

Trp Met Gly Phe Asn Tyr Leu Arg He Arg Arg Ala Ala Lys He Val 
1 5 10 ' 15 

Asp Asn Glu Glu Phe Glu Ala Leu He Arg Thr Gly Gin Leu He Asp 
20 25 30 

Leu Arg Asp Pro Ala Glu Phe His Arg Lys His He Leu Gly Ala Arg 
35 40 45 

Asn He Pro Ser Ser Gin Leu Lys Thr Ser Leu Ala Ala Leu Arg Lys 
50 55 60 

Asp Lys Pro Val Leu Leu Tyr Glu Asn Gin Arg Ala Gin Arg Val Thr 
65 70 75 80 

Asn Ala Ala Leu Tyr Leu Lys Lys Gin Gly Phe Ser Glu He Tyr He 
"85 -90 95 

Leu Ser Tyr Gly Leu Asp Ser Trp Lys Gly Lys Val Lys Thr Ser 
100 105 11° 

(2) INFORMATION FOR SEQ ID NO : 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 
' (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AC TAAAC C AG CATCGTTCGC AGGAAAATAA GGAC AATAAT CGTGTCTCTT ATGTGGATGG 60 

CAGCCAGTCA AGTCAGAAAA GTGAAAACTT GACACCAGAC CAGGTTAGCC AGAAAGAAGG 12 0 

AATTCAGGCT GAGCAAATTG TAATCAAAAT TACAGATCAG GGCTATGTAA CGTCACACGG 18 0 

TGAC C AC TAT CATTACTATA ATGGGAAAGT TCCTTATGAT GCCCTCTTTA GTGAAGAAC T 240 

CTTGATGAAG GATCCAAACT ATCAACTTAA AGAC GCTG AT ATTGTCAATG AAGTCAAGGG 3 00 

TGGTTATATC ATCAAGGTCG ATGGAAAATA TTATGTCTAC CTGAAAGATG CAGCTCATGC 3 60 

TGATAATGTT CGAACTAAAG ATGAAATCAA TCGTCAAAAA CAAGAACATG TCAAAGATAA 42 0 

TGAGAAGGTT AACTCTAATG TTGCTGTAGC AAGGTC TC AG GGAC GAT AT A C GAC AAATGA 480 

TGGTTATGTC TTTAATCCAG CTGATATTAT CGAAGATACG GGTAATGC TT ATATCGTTCC 540 

TCATGGAGGT CACTATCACT ACATTCCCAA AAGCGATTTA TCTGCTAGTG AATTAGCAGC " 600 

AGCTAAAGCA CATCTGGCTG GAAAAAATAT GCAACCGAGT CAGTTAAGCT ATTCTTCAAC 660 
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AGCTAGTGAC AATAACACGC AATCTGTAGC AAAAGGATCA AC TAGC AAGC CAGCAAATAA 72 0 

ATCTGAAAAT CTCCAGAGTC TTTTGAAGGA AC TC TATGAT TCACCTAGCG CCCAACGTTA 7 80 

CAGTGAATCA GATGGCCTGG TCTTTGACCC TGCTAAGATT ATCAGTCGTA CACCAAATGG 840 

AGTTGCGATT CCGCATGGCG ACCATTACCA CTTTATTCCT TACAGCAAGC TTTCTGCCTT 900 

AGAAGAAAAG ATTGC C AGAA TGGTGC C TAT CAGTGGAACT GGTTCTACAG TTTCTACAAA 9 60 

TGCAAAACCT AATGAAGTAG TGTCTAGTCT AGGCAGTCTT TCAAGCAATC CTTCTTCTTT 10 20 

AACGACAAGT AAGGAGCTCT CTTCAGCATC TGATGGTTAT ATTTTTAATC CAAAAGATAT 10 80 

CGTTGAAGAA ACGGCTACAG CTTATATTGT AAGACATGGT GATCATTTCC ATTACATTCC 1140 

AAAATCAAAT CAAATTGGGC AACCGACTCT TCCAAACAAT AGTCTAGCAA CACCTTCTCC 1200 

ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT GAAGAAGATG GAT AC GGATT 12 60 

TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT GTCATGAGTC AC GGAGAC C A 1320 

CAATCATTAT TTCTTCAAGA AG 1342 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

Leu Asn Gin His Arg Ser Gin Glu Asn Lys Asp Asn Asn Arg Val Ser 
1 5 10 15 

Tvr Val Asp Gly Ser Gin Ser Ser Gin Lys Ser Glu Asn Leu Thr Pro 
20 25 30 

Asp Gin Val Ser Gin Lys Glu Gly He Gin Ala Glu Gin He Val He 
35 40 45 

Lys He Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp His Tyr His 
50 55 60 

Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe Ser Glu Glu Leu 
65 70 75 80 

Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp Ala Asp He Val Asn 
85 50 95 

Glu Val Lys Gly Gly Tyr He He Lys Val Asp Gly Lys Tyr Tyr Val 
100 105 HO 

Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg Thr Lys Asp Glu 
115 12° 125 

lie' Asn Arg Gin Lys" Gin Glu His Val Lys Asp Asn Glu Lys Val Asn. 
130 135. 140 
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Ser Asn Val Ala Val Ala Arg Ser Gin Gly Arg Tyr Thr Thr Asn Asp 
145 150 155 160 

Glv Tyr Val Phe Asn Pro Ala Asp lie lie Glu Asp Thr Gly Asn Ala 
165 170 175 

Tvr lie Val Pro His Gly Gly His Tyr His Tyr He Pro Lys Ser Asp 
180 185 190 

Leu Ser Ala Ser Glu Leu. Ala Ala Ala Lys Ala His Leu Ala Gly Lys 
195 200 205 

Asn Met Gin Pro Ser Gin Leu Ser Tyr Ser Ser Thr Ala Ser Asp Asn 
210 215 220 

Asn Thr Gin Ser Val Ala Lys Gly Ser Thr Ser Lys Pro Ala Asn Lys 
225 230 235 240 

Ser Glu Asn Leu Gin Ser Leu Leu Lys Glu Leu Tyr Asp Ser Pro Ser 
245 250 255 

Ala Gin Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe Asp Pro Ala Lys 
260 265 270 

He He Ser Arg Thr Pro Asn Gly Val Ala He Pro His Gly Asp His 
275 280 285 

Tvr His Phe He Pro Tyr Ser Lys Leu Ser Ala Leu Glu Glu Lys He 
290 295 300 

Ala Arg Met Val Pro He Ser Gly Thr Gly Ser Thr Val Ser Thr Asn 
305 310 315 320 

Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser Leu Ser Ser Asn 
325 330 335 

Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser Ala Ser Asp Gly 
340 345 350 

Tyr He Phe Asn Pro Lys Asp He Val Glu Glu Thr Ala Thr Ala Tyr 
355 360 365 

He Val Arg His Gly Asp His Phe His Tyr lie Pro Lys Ser Asn Gin 
. 370 375 380 

He Gly Gin Pro Thr Leu Pro Asn Asn Ser Leu Ala Thr Pro Ser Pro 
385 390 395 400 

Ser Leu Pro He Asn Pro Gly Thr Ser His Glu Lys His Glu Glu Asp 
405 410 415 

Glv Tvr Glv Phe Asp Ala Asn Arg He He Ala Glu Asp Glu Ser Gly 
- 420 425 430 

Phe Val Met Ser His Gly Asp His Asn His Tyr Phe Phe Lys Lys 
435 440 445 

INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TGACTACCTT GAAATCCCAC TTTACAGCTA TCTTGGTGGA TTCAACACTA AAGTTCTTCC 60 

AACTCCAATG ATGAACATCA TCAACGGTGG TTCTCACTCT GACGCTCCAA TCGCTTTCCA 12 0 

AGAGTTCATG ATCTTGCCAG TTGGTGCGCC AACATTTAAA GAAGCCCTTC GTTACGG.TGC 18 0 

TGAAATCTTC CACGCTCTTA AGAAAATCCT TAAATCACGT GGTTTGGAAA CTGCCGTAGG 240 

TGACGAAGGT GGATTCGCTC CTCGTTTCGA AGGAACTGAA GATGGTGTTG AAACTATCCT 3 00 

TGCTGCGATT GAAGCTGCTG GATATGTACC AGGTAAAGAC GTATTTATCG GATTTG AC TG 360 

TGCTTCATCA GAATTCTACG ATAAAGAACG TAAAGTTTAC G AC T AC AC T A AATTTGAAGG 42 0 

TGAAGGTGCT GCTGTTCGTA CATCTGCAGA ACAAATCGAC TACCTTGAAG AATTGGTTAA 48 0 

CAAATACCCA ATCATCACTA TTGAAGATGG TATGGATGAA AACGACTGGG ATGGTTGGAA 540 

AGCTCTTACT GAACGTCTTG GTAAGAAAGT ACAACTTGTT GGTGAC G AC T TCTTCGTAAC 600 

AAACACTGAC TACCTTGCAC GTGGTATCCA AGAAGGTGCT GCTAACTCAA TCCTTATCAA 660 

AGTTAACCAA ATCGGTACTC TTAC TGAAAC TTTTGAAGCT ATCGAAATGG CTAAAGAAGC 72 0 

TGGTT AC AC T GCTGTTGTAT CACACCGTTC AGGTGAAACT GAAGATTCAA CAATCGCTGA 780 

TATTGCAGTT GCAACTAACG CAGGACAAAT CAAGACTGGT TCACTTTCAC GTACAGACCG 840 

CATCGCTAAA TACAACCAAT TGCTTCGTAT CGAAGACCAA CTTGGTGAAG TAGCTGAATA 900 

TCGTGGATTG AAATCATTCT ACAACCTTAA AAAA 934 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

• Asp Tyr Leu Glu lie Pro Leu Tyr Ser Tyr Leu Gly Gly Phe Asn Thr 
15 10 15 

Lys Val Leu Pro Thr Pro Met Met Asn lie lie Asn Gly Gly Ser His 
20 25 30 

Ser Asp Ala Pro He Ala Phe Gin Glu Phe Met He Leu Pro Val Gly 
35 40 45 

Ala Pro Thr Phe Lys Glu Ala Leu Arg Tyr Gly Ala Glu He Phe His 
50 55 60 

Ala Leu Lys Lys He Leu Lys Ser Arg Gly Leu Glu Thr Ala Val Gly 
65 70 75 80 
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Asp Glu Gly Gly Phe Ala Pro Arg Phe Glu Gly Thr Glu Asp Gly Val 
Glu Thr lie Leu Ala Ala He Glu Ala Ala Gly Tyr Val Pro Gly Lys 
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Asp Val Phe He Gly Phe Asp Cys Ala Ser Ser Glu Phe Tyr Asp Lys 

120 ±Zd 
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Glu Arg Lys Val Tyr Asp Tyr Thr Lys Phe Glu Gly Glu Gly Ala Ala 
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Val Arg Thr Ser Ala Glu Gin He Asp Tyr Leu Glu Glu Leu Val Asn 
145 150 

Lvs Tyr Pro He lie Thr He Glu Asp Gly Met Asp Glu Asn Asp Trp 
165 170 

Asp Gly Trp Lys Ala Leu Thr Glu Arg Leu Gly Lys Lys Val Gin Leu 

180 185 

Val Gly Asp Asp Phe Phe Val Thr Asn Thr Asp Tyr Leu Ala Arg. Gly 
195 200 

He Gin Glu Gly Ala Ala Asn. Ser He Leu He Lys Val Asn Gin He 

210 215 220 

Gly Thr Leu Thr Glu Thr Phe Glu Ala He Glu Met Ala Lys Glu Ala 
225 230 

Gly Tyr Thr Ala Val Val Ser His Arg Ser Gly Glu Thr Glu Asp Ser 
245 250 

Thr lie Ala Asp He Ala Val Ala Thr Asn Ala Gly Gin He Lys Thr 

260 265 
Gly Ser Leu Ser Arg Thr Asp Arg He Ala Lys Tyr Asn Gin Leu Leu 

Arg He Glu Asp Gin Leu Gly Glu Val Ala Glu Tyr Arg Gly Leu Lys 

Ser Phe Tyr Asn Leu Lys Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

TCGTATCTTT TTTTGGAGCA ATGTTCGCGT AGAAGGACAT TC C ATGGATC CGACCCTAGC 60 

GGATGGCGAA ATTCTCTTCG TTGTAAAACA CCTTCCTATT G AC C GTTTTG ATATCGTGGT 120 

GGCCCATGAG GAAGATGGCA ATAAGGACAT CGTCAAGCGC GTGATTGGAA TGCCTGGCGA 180 
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CACCATTCGT T AC G AAAATG ATAAACTCTA CATCAATGAC AAAGAAACGG ACGAGCCTTA 240 

TCTAGCAGAC TATATCAAAC GCTTCAAGGA TGACAAACTC CAAAGCACTT ACTCAGGCAA 3 00 

GGGCTTTGAA GGAAATAAAG GAACTTTCTT TAGAAGTATC GCTCAAAAAG CTCAAGCCTT 3 60 

CACAGTTGAT GTCAACTACA ACACCAACTT TAGCTTTACT GTTCCAGAAG GAGAATACCT 42 0 

TCTCCTCGGA GATGACCGCT TGGTTTCGAG CGACAGCCGC CACGTAGGTA CCTTCAAAGC 480 

AAAAG AT ATC ACAGGGGAAG CTAAATTCCG CTTATGGCCA ATCACCCGTA TCGGAACATT 540 



T 

(2) INFORMATION FOR SEQ ID NO: 18 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Arg He Phe Phe Trp Ser Asn Val Arg Val Glu Gly His Ser Met Asp 
1 5 10 . 15 

Pro Thr Leu Ala Asp Gly Glu He Leu Phe Val Val Lys His Leu Pro 
20 25 30 

He Asp Arg Phe Asp He Val Val Ala His Glu Glu Asp Gly Asn Lys 
35 -40 45 

Asp He Val Lys Arg Val He Gly Met Pro Gly Asp Thr He Arg Tyr 
50 55 60 

Glu Asn Asp Lys Leu Tyr He Asn Asp Lys Glu Thr Asp Glu Pro Tyr 
65 70 . 75 80 

Leu Ala Asp Tyr He Lys Arg Phe Lys Asp Asp Lys Leu Gin Ser Thr 
85 90 95 

Tyr Ser Gly Lys Gly Phe Glu Gly Asn Lys Gly Thr . Phe Phe Arg Ser 
100 105 HO 

He Ala Gin Lys Ala Gin Ala Phe Thr Val Asp Val Asn Tyr Asn Thr 
115 120 125 

Asn Phe Ser Phe Thr Val Pro Glu Gly Glu Tyr Leu Leu Leu Gly Asp 
130 135 140 

Asp Arg Leu Val Ser Ser Asp Ser Arg His Val Gly Thr Phe Lys Ala 
145 150 155 160 

Lys Asp He Thr Gly Glu Ala Lys Phe Arg Leu Trp Pro He Thr Arg 
165 I 70 175 

He Gly Thr Phe 
180 

(2) INFORMATION FOR SEQ ID NO: 187: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

GGACTCTCTC AAAGATGT G A AAGCAAATGC TAGCGACAGC AAGCCTGCAC AGGACAAGAA 60 

GGATGCAAAA CAAGGAACGG AAGATAGTAA GGATTCAGAT AAGATGACTG AAACAAACTC 12 0 

AGTTCCGGCA GGAGTGATTG TGGTCAGTCT ACTTGCCCTC CTAGGCGTGA TTGCCTTCTG 180 

GCTGATTCGC CGTAAGAAAG AGTCAGAAAT CCAGCAATTA AGC AC GGAAT TGATCAAGGT 240 

TCTAGGACAG CTAGATGCAG . AAAAAGCGGA TAAAAAAGTC CTTGCCAAAG CCCAAAACCT 3 00 

TCTCCAAGAA ACCCTTGATT TCGTGAAAGA AGAAAATGGC TCAGCAGAGA C AG AAA.C T AA 3 60 

ACTAGTAGAG GAGCTTAAAG CAATCCTTGA CAAACTCAAG 400 
(2) INFORMATION FOR SEQ ID NO: 18 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Asp Ser Leu Lys Asp Val Lys Ala Asn Ala Ser Asp Ser Lys Pro Ala 
15 10 15 

Gin Asp Lys Lys Asp Ala Lys Gin Gly Thr Glu Asp Ser Lys Asp Ser 
20 25 30 

Asp Lys Met Thr Glu Thr Asn Ser Val Pro Ala Gly Val lie Val Val 
35 40 45 

Ser Leu Leu Ala Leu Leu Gly Val He Ala Phe Trp Leu He Arg Arg 
50 55 60 

Lvs Lvs Glu Ser Glu He Gin Gin Leu Ser Thr Glu Leu He Lys Val 
65 70 75 80 

Leu Gly Gin Leu Asp Ala Glu Lys Ala Asp Lys Lys Val Leu Ala Lys 
85 90 95 

Ala Gin Asn Leu Leu Gin Glu Thr Leu Asp Phe Val Lys Glu Glu Asn 
100 105 110 

Gly Ser Ala Glu Thr Glu Thr Lys Leu Val Glu Glu Leu Lys Ala He 
115 120 125 

Leu Asp Lys Leu Lys 
130 
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(2) INFORMATION FOR SEQ ID NO : 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



CAAGAAATCC 


TATCATCTCT 


T C C AG AAG C A 






AGAC TC AGTT 


60 


GATTGAAGAA 


TCGCTTAGTC 


AGC AG AC TAT 


AAltLAblLL 




A A AC AG AATT 


120 


TATCCAAAGA 


TTGCGTGAGG 


CTCATGACAA 




TATTCTCAGT 


CAGCCATCTT 


180 


TTATTC TTC A 


ACGGTCAATC 


CTTCGACTCG 


nmmiTiPTA A A T* 


GCACTCATTT 


ATGCCCTTTT 


240 


AGCTGGAGTA 


GGAGCTTATC 


GTATCATGAT 


/^PP rnn"»< r "* 7\ PfT 1 
CjCjVj 1 IL-AAj^L. 


TTGACCGTCG 


GTCGTTTAGT 


300 


GAC TTTTTTG 


AACTATGTTC 


AGCAATACAC 


/-* A A P 1 P 1 /~" f" ,r P r P r P 


AACGATATTT 


CTTCAGTGCT 


360 


AGCTGAGTTG 


CAAAGTGCTC 


TGGC TTGC GT 




TATGGAGTCT 


TAGATAGCCC 


420 


TGAAGTGGCT 


GAAACAGGTA 


AGGAAGTCTT 


GAC GAC C AGT 


GACCAAGTTA 


AGGGAGCTAT 


480 


TTC C TTT AAA 


CATGTCTCTT 


TTGGCTACCA 


TCCTGAAAAA 


ATTTTGATTA 


AGGACTTGTC 




TATCGATATT 


CCAGCTGGTA 


GTAAGGTAGC 


CATCGTTGGT 


CCGACAGGTG 


CTGGAAAATC 


600 


AACTCTTATC 


AATCTCCTTA 


TGC GTTTTT A 


TCCCATTAGC 


TCGGGAGATA 


TCTTGCTGGA 


660 


TGGGCAATCC 


ATTTATGATT 


AT AC AC GAGT 


ATCATTGAGA 


CAGCAGTTTG 


GTATGGTGCT 


720 


TCAAGAAACC 


TGGCTCACAC 


AAGGGACCAT 


TCATGATAAT 


ATTGCCTTTG 


GCAATCCTGA 


780 


AGCCAGTCGA 


GAGCAAGTAA 


TTGCTGCTGC 


CAAAGCAGCT 


AATGC AGAC T 


TTTTCATCCA 


840 


ACAGTTGCCA 


C AGGGATAC G , 


ATACCAAGTT 


GGAAAATGCT 


GGAGAATCTC ' TCTCTGTCGG 


900 


CCAAGCTCAG 


CTCTTGACCA 


T AGC C C GAGT 


CTTTCTGGCT 


ATTCCAAAGA 


TTC TTATC TT 


960 


AGAC GAGGC A 


ACTTCTTCCA 


TT GAT AC AC G 


GACAGAAGTG 


CTGGTACAGG 


ATGCCTTTGC 


1020 


AAAACTCATG 


AAGGGCCGCA 


CAAGTTTCAT 


CATTGCTCAC 


CGTTTGTCAA 


CCATTCAGGA 


1080 


TGC GGATTT A 


ATTCTTGTCT 


TAGTAGATGG 


TGATATTGTT 


GAATATGGTA 


ACCATCAAGA 


1140 


ACTCATGGAT 


AGAAAGGGTA 


AGT ATT AC C A 


AATGCAAAAA 


GCTGCGGCTT 


TTAGTTCTGA 


1200 


A 
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(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Lys Lys Ser Tyr His Leu Phe Gin Lys Gin Thr Glu Thr Arg Gly He 
x y 5 10 15 

Gin Thr Gin Leu He Glu Glu Ser Leu Ser Gin Gin Thr He He Gin 
20 25 • 30 

Ser Phe Asn Ala Gin Thr Glu Phe He Gin Arg Leu Arg Glu Ala His 
35 40 45 

Asd Asn Tyr Ser Gly Tyr Ser Gin Ser Ala He Phe Tyr Ser Ser Thr 
50 55 60 

Val Asn Pro Ser Thr Arg Phe Val Asn Ala Leu He Tyr Ala Leu Leu 
65 70 75 80 

Ala Gly Val Gly Ala Tyr Arg He Met Met Gly Ser Ala Leu Thr Val 
85 90 95 

Gly Arg Leu Val Thr Phe Leu Asn Tyr Val Gin Gin Tyr Thr Lys Pro 
100 105 HO 

Phe Asn Asp He Ser Ser Val Leu Ala Glu Leu Gin Ser Ala Leu Ala 
115 ' 120 125 

Cvs Val Glu Arg He Tyr Gly Val Leu Asp Ser Pro Glu Val Ala Glu 
130 . 135 140 

Thr Gly Lys Glu Val Leu Thr Thr Ser Asp Gin Val Lys Gly Ala He 
145 150 155 160 

Ser Phe Lys His Val Ser Phe Gly Tyr His Pro Glu Lys He Leu He 
165 170 175 

Lys Asp Leu Ser He Asp He Pro Ala Gly Ser Lys Val Ala He Val 
180 185 190 

Gly Pro Thr Gly Ala Gly Lys Ser Thr Leu He Asn Leu Leu Met Arg 
195 200 205 

Phe Tyr Pro He Ser Ser Gly Asp lie Leu Leu Asp Gly Gin Ser He 
210 215 220 

Tyr Asp Tyr Thr Arg Val Ser Leu Arg Gin Gin Phe Gly Met Val Leu 
225 230 235 240 

Gin Glu Thr Trp Leu Thr Gin Gly Thr He His Asp Asn He Ala Phe 
245 250 255 

Gly Asn Pro Glu Ala Ser Arg Glu Gin Val He Ala Ala Ala Lys Ala 
260 265 270 

Ala Asn Ala Asp Phe Phe He Gin Gin Leu Pro Gin Gly Tyr Asp Thr 
275 280 285 

Lys Leu Glu Asn Ala Gly Glu Ser Leu Ser Val Gly Gin Ala Gin Leu 
290 295 300 

Leu Thr He Ala Arg Val Phe Leu Ala He Pro Lys He Leu He Leu 
305 310 315 320 
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Asp Glu Ala Thr Ser Ser He Asp Thr Arg Thr Glu Val Leu Val Gin 
325 330 335 

Asp Ala Phe Ala Lys Leu Met Lys Gly Arg Thr Ser Phe He He Ala 
340 345 350 

His Arg Leu Ser Thr He Gin Asp Ala Asp Leu He Leu Val Leu Val 
355 360 365 

Asp Gly Asp He Val Glu Tyr Gly Asn His Gin Glu Leu Met Asp Arg 
370 375 380 

Lys Gly Lys Tyr Tyr Gin Met Gin Lys Ala Ala Ala Phe Ser Ser Glu 
385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



ACGAAATGCA 


GGGCAGACAG 


ATGCCTCGCA 


AATTGAAAAG 


GCGGCAGTTA 


GC C AAGGAGG 


60 


AAAAGCAGTG 


AAAAAAACAG 


AAATTAGTAA 


AG AC GC AG AC 


TTGCACGAAA 


TTTATCTAGC 


120 


TGGAGGTTGT 


TTCTGGGGAG 


TGGAGGAATA 


TTTCTCACGT 


GTTCCCGGGG 


TGAC GGATGC 


180 


CGTTTCAGGC 


TATGCAAATG 


GTAGAGGAGA 


AACAACCAAG 


T AC GAATTGA 


TTAACCAAAC 


240 


AGGTCATGCA 


GAAACCGTCC 


ATGTCACCTA 


TGATGCCAAG 


CAAATTTCTC 


TCAAGGAAAT 


300 


CCTGCTTCAC 


TATTTCCGCA 


TTATCAATCC 


AACCAGCAAA 


AATAAACAAG 


GAAATGATGT 


360 


GGGG AC C C AG 


TACCGTACTG 


GTGTTTATTA 


CACAGATGAC 


AAGGATTTGG 


AAGTGATTAA 


420 


CCAAGTCTTT 


GATGAGGTGG 


CTAAGAAATA 


CGATCAACCT 


CTAGCAGTTG 


AAAAGGAAAA 


480 


CTTGAAGAAT 


TTTGTGGTGG 


CTGAGGATTA 


CCATCAAGAC 


TATCTCAAGA 


AAAATCCAAA 


540 


TGGCTACTGC 


CATATCAATG 


TTAATCAGGC 


GGCCTATCCT 


GTCATTGATG 


CCAGCAAATA . 


600 


TCCAAAACCA 


AGTGATGAGG 


AATTGAAAAA 


GACCCTGTCA 


CCTGAGGAGT 


ATGCAGTTAC 


660 


CCAGGAAAAT 


CAAACAGAAC 


GAGCTTTCTC 


AAACCGTTAC 


TGGGATAAAT 


TTGAATCCGG 


720 


TATC TATGTG 


GATATAGCAA 


CTGGGGAACC 


TCTCTTTTCA 


TCAAAAGACA 


AATTTGAGTC 


780 


TGGTTGTGGC 


TGGCCTAGTT 


TTACCCAACC 


CATCAGTCCA 


GATGTTGTCA 


CCTACAAGGA 


840 


AGATAAGTCC 


TACAATATGA 


CGCGTATGGA 


AGTGG GGAGC 


C G AGTAGG AG 


ATTCTCACCT 


900 


TGGGCATGTC 


TTTAC GGATG 


GTCCACAGGA 


CAAGGGCGGC 


TTACGTTACT 


GTATCAATAG 


.960 


CCTCTCTATC 


CGCTTTATTC 


CCAAAGACCA 


AATGGAAGAA 


AAAGGC T AC G 


CTTATTTACT 


1020 


AGATTATGTT 


GAT 
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(2) INFORMATION FOR SEQ ID NO: 192: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Arg Asn Ala Gly Gin Thr Asp Ala Ser Gin He Glu Lys Ala Ala Val 
1 5 10 15 

Ser Gin Gly Gly Lys Ala Val Lys Lys Thr Glu He Ser Lys Asp Ala 
20 25 30 

Asp Leu His Glu He Tyr Leu Ala Gly Gly Cys Phe Trp Gly Val Glu 
35 40 45 

Glu Tyr Phe Ser Arg Val Pro Gly Val Thr Asp Ala Val Ser Gly Tyr 
50 55 60 

Ala Asn Gly Arg Gly Glu Thr Thr Lys Tyr Glu Leu He Asn Gin Thr 
65 70 75 80 

Gly His Ala Glu Thr Val His Val Thr Tyr Asp Ala Lys Gin He Ser 
85 90 95 

Leu Lys Glu He Leu Leu His Tyr Phe Arg He He Asn Pro Thr Ser 
100 ' 105 HO 

Lys Asn Lys Gin Gly Asn Asp Val Gly Thr Gin Tyr Arg Thr Gly Val 
115 120 125 

Tvr Tyr Thr Asp Asp Lys Asp Leu Glu Val He Asn Gin Val Phe Asp 
130 135 140 

Glu Val Ala Lys ,Lys Tyr Asp Gin Pro Leu Ala Val Glu Lys Glu Asn 
145 150 155 160 

Leu Lys -Asn Phe Val Val Ala Glu Asp Tyr His Gin Asp Tyr Leu Lys 
165 170 175 

Lys Asn Pro Asn Gly Tyr Cys His He Asn Val Asn Gin Ala Ala Tyr 
180 185 190 

Pro Val He Asp Ala Ser Lys Tyr - Pro Lys Pro Ser Asp Glu Glu Leu 
195 200 205 

Lvs Lys Thr Leu Ser Pro Glu Glu Tyr Ala Val Thr Gin Glu Asn Gin 
. 210 215 220 

Thr Glu Arg Ala Phe Ser Asn Arg Tyr Trp Asp Lys Phe Glu Ser Gly 
225 230 235 240 

He Tyr Val Asp He Ala Thr Gly Glu Pro Leu Phe Ser Ser Lys Asp 
245 250 255 

Lys Phe Glu Ser Gly Cys Gly Trp Pro- Ser Phe Thr Gin Pro lie Ser 
260 265 270 

Pro Asp Val Val Thr Tyr Lys Glu Asp Lys Ser Tyr Asn Met Thr Arg 
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275 280 285 

Met Glu Val' Arg Ser Arg Val Gly Asp Ser His Leu Gly His Val Phe 
290 295 300 

Thr Asp Gly Pro Gin Asp Lys Gly Gly Leu Arg Tyr Cys He Asn Ser 
305 310 315 320 

Leu Ser He Arg Phe He Pro Lys Asp Gin Met Glu Glu Lys Gly Tyr 
325 330 335 

Ala Tyr Leu Leu Asp Tyr Val Asp 
340 

(2) INFORMATION FOR SEQ ID NO : 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) - STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TGTATAGTTT TTAGCGCTTG TTCTTCTAAT TCTGNTAAAA ATGAAGAAAA T AC TTC T AAA 60 

GAGCATGCGC C T G AT AAAAT AGTTTTAGAT CATGCTTTCG GTCAAACTAT ATTAGATAAA 12 0 

AAAC C TG AAA GAGTTGCAAC TATTGCTTGG GGAAATCATG ATGTAGCATT AGCTTTAGGA 18 0 

ATAGTTCCTG TTGGATTTTC AAAAGCAAAT TACGGTGTAA GTGCTGATAA AGGAGTTTTA 240 

C C ATGGAC AG AAGAAAAAAT CAAAGAACTA AATGGTAAAG CTAACCTATT TGACGATTTG 3 00 

GATGGACTTA AC TTTGAAGC AATATCAAAT TCTAAAC C AG ATGTTATCTT AGCAGGTTAT 3 60 

TCTGGTATAA CTAAAGAAGA TTATGAC AC T CTATCA 3 96 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Cvs He Val Phe Ser Ala Cys Ser Ser Asn Ser Xaa Lys Asn Glu Glu 
1 5 10 15 

Asn Thr Ser Lys Glu His Ala Pro Asp Lys He Val Leu Asp His Ala 
20 25 30 

Phe Glv Gin Thr He Leu Asp Lys Lys Pro Glu Arg Val Ala Thr He 
35 40 45 

Ala Trp Gly Asn His Asp Val Ala Leu Ala Leu Gly He Val Pro Val 
50 55 60 
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Glv Phe Ser Lys Ala Asn Tyr Gly Val Ser Ala Asp Lys Gly Val Leu 
65 70 75 80 

Pro Trp Thr Glu Glu Lys lie Lys Glu Leu Asn Gly Lys Ala Asn Leu 
85 90 95 

Phe Asp Asp Leu Asp Gly Leu Asn Phe Glu Ala lie Ser Asn Ser Lys 
100 105 110 

Pro Asp Val He Leu Ala Gly Tyr Ser Gly He Thr Lys Glu Asp Tyr 
115 120 125 

Asp Thr Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH-: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
GTGTGTCGAG CATATTCTGA AGCAAACCTA TCAAAATATA GAAATTATTT TAGTTGATGA 60 

CGGTTCTACG GATAATTCTG GGGAAATTTG TGATGC TTTT ATGATGCAAG ATAATCGTGT 120 

GCGAGTATTG CATCAAGAAA ATAAGGGGGG GGCAGCACAA GCTAAAAATA TGGGGATTAG 180 

TGTAGCTAAG GGAGAGTACA TCACGATTGT TGATTCAGAT GATATC GTAA AAGAAAATAT 2 40 

GATTGAAACT CTTTATCAGC AAGTCCAAGA AAAGGATGCA GATGTTGTTA TAGGGAATTA 3 00 

CTATAATTAT GAC GAAAGTG ACGGGAATTT TTATTTTTAT GTAACAGGGC AAGATTTTTG 3 60 

CGTCGAAGAA TTAGCTATAC AAGAAATTAT GAACCGTCAA GCAGGAGATT GGAAATTCAA 420 

TAGCTCGGCC TTTATATTGC C GAC ATTTAA GTTGATTAAA AAAGAATTAT TCAATGAAGT 480 

TCACTTTTCA AATGGTCGCC GCTTTGATGA TGAAGCAACT ATGCATCGCT TTTATCTTTT 540 

AGCCTCTAAA ATCGTCTTTA TAAACGATAA TCTCTATCTG TATAGAAGAC GTTCAGGAAG 600 

CATCATGAGA AC GGAATTTG ATCTTTCCTG GGCAAGAGAT ATTGTTGAAG TGTTTTCTAA 660 

GAAAATATCG GATTGTGTCT TGGCTGGTTT GGATGTCTCC GTTCTGCGTA TTC G ATTTGT 720 

CAATCTTTTA AAAGATTATA AGCAAACTTT AGAATACCAT CAATTAACAG AT AC TG AGGA 7 80 

ATATAAAGAT ATTTGTTTCA GATTAAAGTT GTTTTTTGAT GCAGAACAAA GAAATGGTAA 840 
AAGT 

(2) INFORMATION FOR SEQ ID NO: 19 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Cys Val Glu His lie Leu Lys Gin Thr Tyr Gin Asn lie Glu lie lie 
1 5 10 15 

Leu Val Asp Asp Gly Ser Thr Asp Asn Ser Gly Glu lie Cys Asp Ala 
20 25 30 

Phe Met Met Gin Asp Asn Arg Val Arg Val Leu His Gin Glu Asn Lys 
35 40 45 

Gly Gly Ala Ala Gin Ala Lys Asn Met Gly lie Ser Val Ala Lys Gly 
50 55 60 

Glu Tyr lie Thr lie Val Asp Ser Asp Asp lie Val Lys Glu Asn Met 
65 70 75 80 

lie Glu Thr Leu Tyr Gin Gin Val Gin Glu Lys Asp Ala Asp Val Val 
85 90 95 

lie Gly Asn Tyr Tyr Asn Tyr Asp Glu Ser Asp Gly Asn Phe Tyr Phe 
100 105 HO 

Tyr Val Thr Gly Gin Asp Phe Cys Val Glu Glu Leu Ala lie Gin Glu 
115 120 125 

lie Met Asn Arg Gin Ala Gly Asp Trp Lys Phe Asn Ser Ser Ala Phe 
130 135 140 

lie Leu Pro Thr Phe Lys Leu lie Lys Lys Glu Leu Phe Asn Glu Val 
145 150 155 160 

His Phe Ser Asn Gly Arg Arg Phe Asp Asp Glu Ala Thr Met His Arg 
165 170 175 

Phe Tyr Leu Leu Ala Ser Lys lie Val Phe lie Asn Asp Asn Leu Tyr 
180 185 190 

Leu Tyr Arg Arg Arg Ser Gly Ser lie Met. Arg . Thr Glu Phe Asp Leu 
195 200 205 

Ser Trp Ala Arg Asp lie Val Glu Val Phe Ser Lys Lys lie Ser Asp 
210 .215 220 

Cvs Val Leu Ala Gly Leu Asp Val Ser Val Leu. Arg lie. Arg Phe Val 
225 230 235 240 

Asn Leu Leu Lys Asp Tyr Lys Gin Thr Leu Glu Tyr His Gin Leu Thr 
245 250 255 

Asp Thr Glu Glu Tyr Lys Asp lie Cys Phe Arg Leu Lys Leu Phe Phe 
260 265 270 

Asp Ala Glu Gin Arg Asn Gly Lys Ser 
275 280 

INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



GTGTTTGGAT 


AGCATTCAGA 


ATCAGACGTA 


TCAAAA1 111 


lb 1 1 1 A 1 


Hp A A rp/-« A A TCI Zi 


60 


TGGCTCTCCA 


GATCATTCAT 


CCAAAATATG 


TGAAGAATT 1 


(j I Atj ACjAAAO 


Ai. IL ILu 111 


J_ z, w 


CAAATATTTT 


GAGAAAGCAA 


ACGGCGGTCT 


1 1 L- A 1 LAbL 1 




J. .rt. X X Uflri J. 


180 


TTCGGGGGGG 


GCGTACATTA 


CTTTTGTAGA 


C TCTGATG AT 


TGGTTGGAAC 


ATGATGCTTT 


240 


AGACC GATTA 


TATGGTGCTT 


TGAAAAAGGA 


AAACGCAGAT 


ATTAGTATCG 


GGCGTTATAA 


300 


TTCTTATGAT 


GAAACACGCT 


ATGTGTATAT 


GACTTATGTT 


AC GGATC C AG 


ATGATTCTCT 


360 


AGAAGTGATA 


GAAGGTAAAG 


CAATTATGGA 


TAGGGAAGGT 


GTC GAAGAAG 


TCAGAAATGG 


420 


GAACTGGACT 


GTAGCTGTCT 


TGAAGTTATT 


CAAGAGAGAG 


TTACTACAAG 


ATTTACCATT 


480 


TCCTATAGGA 


AAAATTGCAG 


AGGATACTTA 


CTGGACATGG 


AAGGT AC TTC 


T AAGAGC TTC 


540 


GAGGATAGTC 


TATTTGAATC 


GTTGTGTTTA 


CTGGTACCGT 


GTTGGTTTAT 


CTGATACTTT 


600 


ATC GAAT AC A 


TGGAGTGAAA 


AGCGTATGTA 


TGATGAAATT 


GGGGCTAGGG 


AAGAAAAGAT 


660 


AGCTATTTTA 


GCAAGTTCAG 


ACTATGACTT 


GACCAATCAT 


ATTTTGATTT 


ATAAAAATAG 


720 


ATTACAAAGA 


GTGATAGCAA 


AATTAGAAGA 


ACAAAATATG 


CAGTTCACAG 


AGATTTACAG 


780 


AAGAATGATG 


GAAAAATTGT 


CTTTACTTCC 


G 
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(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Cvs Leu Asp Ser lie Gin Asn Gin Thr Tyr Gin Asn Phe Glu Cys Leu 
1 5 10 15 

Leu He Asn Asp Gly Ser Pro Asp His Ser Ser Lys He Cys Glu Glu 
20 25 30 

Phe Val Glu Lys Asp Ser Arg Phe Lys Tyr Phe Glu Lys Ala Asn Gly 
35 40 45 

Gly Leu Ser Ser Ala Arg Asn Leu Gly He Glu Cys Ser Gly Gly Ala 
50 55 60 

Tyr He Thr Phe Val Asp Ser Asp Asp Trp Leu Glu His Asp Ala Leu 
65 70 75 80 



187 



Asp Arg Leu Tyr Gly Ala Leu Lys Lys Glu Asn Ala Asp lie Ser lie 
85 90 95 

Gly Arg Tyr Asn Ser Tyr Asp Glu Thr Arg Tyr Val Tyr Met Thr Tyr 
100 105 110 

Val Thr Asp Pro Asp Asp Ser Leu Glu Val lie Glu Gly Lys Ala lie 
115 120 125 

Met Asp Arg Glu Gly Val Glu Glu Val Arg Asn Gly Asn Trp Thr Val 
130 135 140 

Ala Val Leu Lys Leu Phe Lys Arg Glu Leu Leu Gin Asp Leu Pro Phe 
145 150 155 160 

Pro lie Gly Lys He Ala Glu Asp Thr Tyr Trp Thr Trp Lys Val Leu 
165 170 175 

Leu Arg Ala Ser Arg He Val Tyr Leu Asn Arg Cys Val Tyr Trp Tyr 
180 185 190 

Arg Val Gly Leu Ser Asp Thr Leu Ser Asn Thr Trp Ser Glu Lys Arg 
195 200 205 

Met Tyr Asp Glu He Gly Ala Arg Glu Glu Lys He Ala He Leu Ala 
210 215 220 

Ser Ser Asp Tyr Asp Leu Thr Asn His He Leu He Tyr Lys Asn Arg 
225 230 235 240 

Leu Gin Arg Val He Ala Lys Leu Glu Glu Gin Asn Met Gin Phe Thr 
245 250 255 

Glu He Tyr Arg Arg Met Met Glu Lys Leu Ser Leu Leu Pro 
260 265 270 

(2.) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2023 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GTGCCTAGAT AGTATTATTA CTCAAACATA TAAAAATATT GAGATTGTTG TCGTTAATGA 
TGGTTC T AC G . GATGCTTCAG GTGAAATTTG TAAAGAATTT TCAGAAATGG ATCACCGAAT 
TC TC TAT AT A GAACAAGAAA ATGCTGGTCT TTCTGCCGCA CGAAACACCG GTCTGAATAA 
TATGTCCGGA AATTATGTGA CCTTTGTGGA C TC GG AT GAT TGGATTGAGC AAGATTATGT 
AGAAACTCTA TATAAAAAAA TAGTAGAGTA TCAGGCTGAT ATTGCAGTTG GTAATTATTA 
TTCTTTCAAC GAAAGTGAAG GAATGTTC T A CTTTCATATA TTGGGAGACT CCTATTATGA 
GAAAGTATAT GATAATGTTT CTATCTTTGA GAACTTGTAT GAAACTCAAG AAATGAAGAG 
TTTTGC TTTG ATATCTGCTT GGGGTAAACT CTATAAGGCA AGATTGTTTG AGCAGTTGCG 



CTTTGACATA 


GGTAAATTAG 


GAGAAGATGG 


TTAC C TC AAT 


L. AAAALjIj 1 A i 


7\ T"~PT A ' 1 M P A rp(^* 
nl 1 InJ, 


J *i U 


AGAAAAGGTA 


ATTTATTTAA 


AT AAAAGTC T 


TT AT GC TT AT 




AA^jLj 1 AO 111 


y uu 


ATCAAGAGTT 


TGGACAGAAA 


AGTGGATGCA 


CGCTTTAGTT 


GATGLTATCj 1 


C 1 vj AAL. blAl 


n 


TACGCTACTA 


GCTAATATGG 


GTTATCCTCT 


AGAGAAACAC 


TTGGCAGTTT 


A 1 UVj 1 V- AljA 1 


7 ? n 


GTTGGAAGTC 


AGTCTCGCCA 


ACGGTCAAGC 


TAGTGGTTTA 


TCTGACACAG 


CAALCj I A 1 AA 


7 ft n 


AGAGTTTGAA 


ATGAAAC AAA 


GGC TTTT AAA 


TCAGCTATCG 


AGACAAGAGG 


TV TV TV m /"^ TV 7V 7V 7V 

AAAG T GAAAA 


q a n 

o 4 U 


GAAAGC C ATT 


GTCCTCGCAG 


CAAACTATGG 


CTATGTAGAC 


CAAGTTTTAA 


/"-i /— t -a TV rn^ TV T\ 

C GAC AATC AA 


q n n 


GTCTATTTGT 


TATCATAATC 


GTTCGATTCG 


TTTTTATCTG 


ATTCATAGCG 


tv mmmm/*i^i Tv tv TV 

ATTTTCCAAA 


you 


TGAATGGATT 


AAGCAATTAA 


ATAAGCGCTT 


AGAGAAGTTT 


GAC TC AG AAA 


TTATTAATTG 


i n o n 


TCGGGTAACT 


TCTGAGCAAA 


TTTCATGTTA 


TAAATC GGAT 


ATTAGTTACA 


C AGTC TTTTT 


1UQU 


ACGCTATTTC 


ATAGCTGATT 


TCGTGCAAGA 


AGACAAGGCC 


CTCTACTTGG 


AC TCjTvj A It! 


i i /in 


AGTTGTAACG 


AAAAATC TGG 


ATGAC TTGTT 


TGCTACAGAC 


TTACAAGATT 


ATCLT r 1 QjCjL, 


izuu 


TGCTGTTAGA 


GATTTTGGGG 


GCAGAGCTTA 


TTTTGGTCAA 


GAAATCTTTA 


ATCjL L VjCj l(j 1 


1ZDU 


TCTCTTGGTA 


AACAATGCTT 


TTTGGAAAAA 


AGAGAATATG 


AC C C AAAAAT 


m tv t\ mm/^* tv /"n/"™ 1 /m 

TAATTGATGT 


i ion 

1 jZU 


AACCAATGAA 


TGGCATGATA 


AGGTGGATCA 


GGCAGATCAG 


AGCATCTTGA 


ATATGCTTTT 




TGAACATAAA 


TGGTTGGAAT 


TGGACTTTGA 


TTATAATCAT 


ATTGTCATTC 


T* m tv tv tv /i tv /"I mm 

ATAAACAGTT 




TGCTGATTAT 


CAATTGCCTG 


AGGGTCAGGA 


TTATCCTGCT 


ATTATTCACT 


tv m mm m m TV 

ATCTTTCTCA 


XD UU 


TCGGAAACCG 


TGGAAAGATT 


TGGCGGCCCA 


AACCTATCGT 


GAAGTTTGGT 


vi m tv fn tv t\ 

GGTACTATCA 


1 DOU 


TGGGCTTGAA 


TGGACAGAAT 


TGGGACAAAA 


C CAT C ATTT A 


CATC C ATT AC 


TV TV tv /T TV m/Tri/™' TV 

AAAGATCTLA 


icon 


GATCTATCCA 


ATAAAGGAAC 


CTTTCACTTG 


TCTAATCTAT 


ACTGCCTCAG 


AC C ATATTGA 


1680 


ACAAATTGAG 


ACATTGGTTC 


AATCCTTGCC 


TGATATTCAG 


TTTAAGATAG 


CAGCTAGAGT 


1740 


AATAGTTAGT 


GATCGATTGG 


CTCAGATGAC 


AATTTATCCA 


AACGTGACTA 


T ATTT AAC GG 


1800 


AATTCACTAT 


TTGGTAGATG 


TCGATAATGA 


ATTGGTAGAA 


ACCAGTCAAG 


TACTTTTAGA 


1860 


TATTAATCAT 


GGC GAAAAGA 


CAGAAGAAAT 


TCTCGATCAA 


TTTGCTAATC 


TTGGCAAGCC 


1920 


TATCTTATCC 


TTTGAAAATA 


CTAAAACCTA 


TGAAGTAGGT 


CAGGAGGCAT 


ATGCTGTTGA 


1980 


CCAAGTTCAA 


GCAATGATTG 


AAAAATTGAG 


AGAAATAAGC 


AAA 
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(2) INFORMATION FOR SEQ ID NO: 2 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 amino acids 

(B) TYPE: amino acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 

Cvs Leu Asp Ser He He Thr Gin Thr Tyr Lys Asn He Glu He Val 
1 5 10 15 

Val Val Asn Asp Gly Ser Thr Asp Ala Ser Gly Glu He Cys Lys Glu 
20 25 30 

Phe Ser Glu Met Asp His Arg He Leu Tyr He Glu Gin Glu Asn Ala 
35 40 45 

Gly Leu Ser Ala Ala Arg Asn Thr Gly Leu Asn Asn Met Ser Gly Asn 
50 55 60 

Tyr Val Thr Phe Val Asp Ser Asp Asp Trp He Glu Gin Asp Tyr Val 
65 70 75 80 

Glu Thr Leu Tyr Lys Lys He Val Glu Tyr Gin Ala Asp He Ala Val 
85 90 95 

Glv Asn Tyr Tyr Ser Phe Asn Glu Ser Glu Gly Met Phe Tyr Phe His 
100 105 HO 

He Leu Gly Asp Ser Tyr Tyr Glu Lys Val Tyr Asp Asn Val Ser He 
115 120 125 

Phe Glu Asn Leu Tyr Glu Thr" Gin Glu Met Lys Ser Phe Ala Leu He 
130 135 140 

Ser Ala Trp Gly Lys Leu Tyr Lys Ala Arg Leu Phe Glu Gin Leu Arg 
145 150 155 160 

Phe Asp He Gly Lys Leu Gly Glu Asp Gly Tyr Leu Asn Gin Lys Val 
165 170 175 

Tvr Leu Leu Ser Glu Lys Val He Tyr Leu Asn Lys Ser Leu Tyr Ala 
180 185 190 

Tyr Arg He Arg Lys Gly Ser Leu Ser Arg Val Trp Thr Glu Lys Trp 
195 200 205 

Met His Ala Leu Val Asp Ala Met Ser Glu Arg He Thr Leu Leu Ala 
210 215 220 

Asn Met Gly Tyr Pro Leu Glu Lys His Leu Ala Val Tyr Arg Gin Met 
225 230 235 240 

Leu Glu Val Ser Leu Ala Asn Gly Gin Ala Ser Gly Leu Ser Asp Thr 
245 250 255 

Ala Thr Tyr Lys Glu Phe Glu Met Lys Gin Arg Leu Leu Asn Gin Leu 
260 265 270 

Ser Arg Gin Glu Glu Ser Glu Lys Lys Ala He Val Leu Ala Ala Asn 
275 280 285 

Tyr Gly Tyr Val Asp Gin Val Leu Thr Thr He Lys Ser He Cys Tyr 
290 295 300 

His Asn Arg Ser He Arg Phe Tyr Leu He His Ser Asp Phe Pro Asn 
305 310 315 320 

Glu Trp He Lys Gin Leu Asn Lys Arg Leu Glu Lys Phe Asp Ser Glu 
325 330 335 



190 



He He Asn Cys Arg Val Thr Ser Glu Gin, He Ser Cys Tyr Lys Ser 
340 345 350 

Asp He Ser Tyr Thr Val Phe Leu Arg Tyr Phe He Ala Asp Phe Val 
355 360 365 

Gin Glu Asp Lys Ala Leu Tyr Leu Asp Cys Asp Leu Val Val Thr Lys 
370 375 380 

Asn Leu Asp Asp Leu Phe Ala Thr Asp Leu Gin Asp Tyr Pro Leu Ala 
385 390 395 400 

Ala Val Arg Asp Phe Gly Gly Arg Ala Tyr Phe Gly Gin Glu He Phe 
405 410 415 

Asn Ala Gly Val Leu Leu Val Asn Asn Ala Phe Trp Lys Lys Glu Asn 
420 425 430 

Met Thr Gin Lys Leu He Asp Val Thr Asn Glu Trp His Asp Lys Val 
435 440 445 

Asp Gin Ala Asp Gin Ser He Leu Asn Met Leu Phe Glu His Lys Trp 
450 455 460 

Leu Glu Leu Asp Phe Asp Tyr Asn His He Val He His Lys Gin Phe 
465 470 475 480 

Ala Asp Tyr Gin Leu Pro Glu Gly Gin Asp Tyr Pro Ala lie lie His 
485 490 495 

Tyr Leu Ser His Arg Lys Pro Trp Lys Asp Leu Ala Ala Gin Thr Tyr 
500 505 510 

Arg Glu Val Trp Trp Tyr Tyr His Gly Leu Glu Trp Thr Glu Leu Gly 
515 520 525 

Gin Asn His His Leu His Pro Leu Gin Arg Ser His He Tyr Pro He 
530 535 540 

Lvs Glu Pro Phe Thr Cys Leu He Tyr Thr Ala Ser Asp His He Glu 
545 550 555 560 

Gin He Glu Thr Leu Val Gin Ser Leu Pro Asp He Gin Phe Lys He 
565 570 575 

Ala Ala Arg Val He Val Ser Asp Arg Leu Ala Gin Met Thr He Tyr 
580 585 590 

Pro Asn Val Thr He Phe Asn Gly He His Tyr Leu Val Asp Val Asp 
595 600 605 

Asn Glu Leu Val Glu Thr Ser Gin Val Leu Leu Asp lie Asn His Gly 
610 615 620 

Glu Lys Thr Glu Glu He Leu Asp Gin Phe Ala Asn Leu Gly Lys Pro 
625 630 635 640 

He Leu Ser Phe Glu Asn Thr Lys Thr Tyr Glu Val Gly Gin Glu Ala 
645 650 655 

Tyr Ala Val Asp Gin Val Gin Ala Met He Glu Lys Leu Arg Glu He 
660 665 670 
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Ser Lys 

(2) INFORMATION FOR SEQ ID NO: 2 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT GTTGATGATG GTGCAACAGA 60 

TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT GACAGGGTGT CAGTGCTTCA 12 0 

TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAAT GATGGG ATGAAGCAGG CTCACGGGGA 180 

TTATC TGATT TTTATTGACT CAGATGATTA TATCCATCCA GAAATGATTC AGAGCTTATA 240 

TGAGCAATTA GTTCAAGAAG ATGCGGATGT TTCGAGCTGT GGTGTC ATG A ATGTCTATGC 3 00 

TAATGATGAA AGCCCACAGT CAGCCAATCA GGATGAC TAT TTTGTCTGTG ATTCTCAAAC 3 60 

ATTTCTAAAG GAATAC C TC A TAGGTGAAAA AATACCTGGG AC GATTT GC A ATAAGCTAAT 42 0 

CAAGAGACAG ATTGCAACTG CCCTATCCTT TCCTAAGGGG TTGATTTACG AAGATGCCTA 480 

TTACCATTTT GATTT AATCA AGTTGGCCAA GAAGTATGTG GTTAATACTA AACCCTATTA 540 

TT AC TATTTC CATAGAGGGG ATAGTATTAC GACCAAACCC TATGCAGAGA AGGATTTAGC 600 

CTATATTGAT ATCTACCAAA AGTTTTATAA TGAAGTTGTG AAAAACTATC CTGACTTGAA 660 

AGAGGTCGCT TTTTTCAGAT TGGCCTATGC CCACTTCTTT ATTCTGGATA AGATGTTGCT 72 0 

AGATGATCAG TATAAACAGT TTGAAGCCTA TTCTCAGATT CATCGTTTTT TAAAAGGCCA 7 80 

TGCCTTTGCT ATTTCTAGGA ATCCAATTTT CCGTAAGGGG AGAAGAATTA GTGC TTTGGC 840 

CCTATTCATA AATATTTCCT TATATCGATT C TT ATT AC TG AAAAAT ATT G AAAAATCTAA 900 
AAAATTACAT 

(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 



He Gin Lys Gin Thr Tyr Gin Asn Leu Glu He He Leu Val Asp Asp 
1.5 10 15 

Gly Ala Thr Asp Glu Ser Gly Arg Leu Cys Asp Ser He Ala Glu Gin 
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20 25 30 

Asp Asp Arg Val Ser Val Leu His Lys Lys Asn Glu Gly Leu Ser Gin 
35 "40 45 

Ala Arg Asn Asp Gly Met Lys Gin Ala His Gly Asp Tyr Leu He Phe 
50 55 60 

He Asp Ser Asp Asp Tyr He His Pro Glu Met He Gin Ser Leu Tyr 
65 70 75 80 

Glu Gin Leu Val Gin Glu Asp Ala Asp Val Ser Ser Cys Gly Val Met 
85 90 95 

Asn Val Tyr Ala Asn Asp Glu Ser Pro Gin Ser Ala Asn Gin Asp Asp 
100 105 HO 

Tyr phe Val Cys Asp Ser Gin Thr Phe Leu Lys Glu Tyr Leu He Gly 
115 120 125 

Glu Lys He Pro Gly Thr He Cys Asn Lys Leu He Lys Arg Gin He 
130 135 140 

Ala Thr Ala Leu Ser Phe Pro Lys Gly Leu He Tyr Glu Asp Ala Tyr 
145 150 155 160 

Tvr His Phe Asp Leu He Lys Leu Ala Lys Lys Tyr Val Val Asn Thr 
165 170 175 

Lvs Pro Tyr Tyr Tyr Tyr Phe His Arg Gly Asp Ser He Thr Thr Lys 
180 185 190 

Pro Tyr Ala Glu Lys Asp Leu Ala Tyr He Asp He Tyr Gin Lys Phe 
195 200 205 

Tvr Asn Glu Val Val Lys Asn Tyr Pro Asp Leu Lys Glu Val. Ala Phe 
210 215 220 

Phe Arg Leu Ala Tyr Ala His Phe Phe He Leu Asp Lys Met Leu Leu 
225 230 235 240 

Asp Asp Gin Tyr Lys Gin Phe Glu Ala Tyr Ser Gin He His Arg Phe 
245 250 255 

Leu Lvs Gly His Ala Phe Ala He Ser Arg Asn Pro He Phe Arg Lys 
260 265 270 

Gly Arg Arg He Ser Ala Leu Ala Leu Phe He Asn He Ser Leu Tyr 
275 280 285 

Arg Phe Leu Leu Leu Lys Asn He Glu Lys Ser Lys Lys Leu His 
290 295 300 

INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
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TAAGGCTGAT AATCGTGTTC AAATGAGAAC GACGATTAAT AATGAATCGC CATTGTTGCT 

TTCTCCGTTG TATGGCAATG ATAATGGTAA CGGATTATGG TGGGGGAACA CATTGAAGGG 12 0 

AGCATGGGAA GCTATTCCTG AAGATGTAAA GCCATATGCA GCGATTGAAC TTCATCCTGC 18 0 

AAAAGTCTGT AAACCAACAA GTTGTATTCC AC GAG AT AC G AAAGAATTGA GAGAATGGTA 240 

TGTCAAGATG TTGGAGGAAG CTCAAAGTCT AAACATTCCA GTTTTCTTGG TTATTATGTC 300 

GGCTGGAGAG C GTAATAC AG TTCCTCCAGA GTGGTTAGAT GAACAATTCC AAAAGTATAG 3 60 

TGTGTTAAAA GGTGTTTTAA ATATTGAGAA TTATTGGATT TACAATAACC AGTTAGCTCC 420 

GCATAGTGCT AAATATTTGG AAGTTTGTGC CAAATATGGA GCGCATTTTA TCTGGCATGA 480 

TCATGAAAAA TGGTTCTGGG AAACTATTAT GAATGATCCG ACATTCTTTG AAGCGAGTCA 540 

- AAAATATCAT AAAAATTTGG TGTTGGCAAC TAAAAAT AC G CCAATAAGAG ATGATGCGGG 600 

TACAGATTCT ATCGTTAGTG GATTTTGGTT GAGTGGC TT A TGTGATAACT GGGGCTCATC 660 

AACAGATACA TGGAAATGGT GGGAAAAACA TTATACAAAC ACATTTGAAA CTGGAAGAGC 720 

TAGGGATATG AGATCCTATG CATC GGAAC C AGAATCAATG ATTGCTATGG AAATGATGAA 780 

TGT AT AT AC T GGGGGAGGCA CAGTTTATAA TTTCGAATGT GCCGCGTATA CATTTATGAC 840 

AAATGATGTA CCAACTCCAG CATTTACTAA AGGTATTATT CCTTTCTTTA GACATGCTAT 900 

ACAAAATCCA GCTCCAAGTA AGGAAGAAGT TGTAAATAGA ACAAAAGCTG TATTTTGGAA 960 

TGGAGAAGGT AGGATTAGTT CATTAAACGG ATTTTATCAA GGACTTTATT CGAATGATGA 102 0 

AACAATGCCT TTATATAATA ATGGGAGATA TCATATTCTT CCTGTAATAC ATGAGAAAAT 1080 

TGATAAGGAA AAGATTTCAT CTATATTCCC TAATGCAAAA ATTTTGACTA AAAATAGTGA 1140 

GGAATTGTCT AGTAAAGTCA AC TATTT AAA CTCGCTTTAT CCAAAACTTT ATGAAGGAGA 1200 

TGGGTATGCT CAGCGTGTAG GTAATTCCTG GTATATTTAT AATAGTAATG CTAATATCAA 12 6 0 

TAAAAATCAG CAAGTAATGT TGCCTATGTA TACTAATAAT AC AAAGTC GT TATCGTTAGA 132 0 

TTTGACGCCA CATACTTACG CTGTTGTTAA AGAAAATCCA AATAATTTAC ATATTTTATT 1380 

GAATAATTAC AGGACAGATA AGACAGCTAT GTGGGCATTA TCAGGAAATT TTGATGCATC 144 0 

AAAAAGTTGG AAGAAAGAAG AATTAGAGTT AGCGAACTGG ATAAGCAAAA ATTATTCCAT 150 0 

CAATCCTGTA GATAATGACT TTAGGACAAC AACACTTACA TTAAAAGGGC ATACTGGTCA 156 0 

TAAACCTCAG ATAAATATAA GTGGCGATAA AAATCATTAT ACTTATACAG AAAATTGGGA 162 0 

TGAGAATACC CATGTTTATA CCATTACGGT TAATCATAAT GGAATGGTAG AGATGTC TAT 1680 

AAATACTGAG GGGACAGGTC CAGTCTCTTT CCCAACACCA GATAAATTTA ATGATGGTAA 1740 

TTTGAATATA GCATATGCAA AACCAACAAC ACAAAGTTCT GTAGATTACA ATGGAGACCC 180 0 

TAATAGAGCT GTGGATGGTA ACAGAAATGG TAATTTTAAC TCTGGTTCGG TAACACACAC 1860 

TAGGGCAGAT AATCCCTCTT GGTGGGAAGT CGATTTGAAA AAAATGGATA AAGTTGGGCT 192 0 
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TGTTAAAATT TATAATCGCA CAGATGCTGA GACTCAACGT CTATCTAATT TT 1972 
(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 

Lys Ala Asp Asn Arg Val Gin Met Arg Thr Thr He Asn Asn Glu Ser 
1 5 10 

Pro Leu Leu Leu Ser Pro Leu Tyr Gly Asn Asp Asn Gly Asn Gly Leu 
20 25 30 

Trp Trp Gly Asn Thr Leu Lys Gly Ala Trp Glu Ala lie Pro Glu Asp 

35 40 4b 

Val Lys Pro Tyr Ala Ala He. Glu Leu His Pro Ala Lys Val Cys Lys 
50 55 60 

Pro Thr Ser Cys He Pro Arg Asp Thr Lys Glu Leu Arg Glu Trp Tyr 
65 70 75 80 

Val Lys Met Leu Glu Glu Ala Gin Ser Leu Asn He Pro Val Phe Leu 
— 90 



85 



Val He Met Ser Ala Gly Glu Arg Asn" Thr Val Pro Pro Glu Trp Leu 



100 



Asp Glu Gin Phe Gin Lys Tyr Ser Val Leu Lys Gly Val Leu Asn He 
120 12b 



115 



Glu Asn Tyr Trp He Tyr Asn Asn Gin Leu Ala Pro His Ser Ala Lys 

135 140 



130 



Tyr Leu Glu Val Cys Ala Lys Tyr Gly Ala His Phe He Trp His Asp 
145 150 

His Glu Lys Trp Phe Trp Glu Thr He Met Asn Asp Pro Thr Phe Phe 
165 170 

Glu Ala Ser Gin Lys Tyr His Lys Asn Leu Val Leu Ala Thr Lys Asn 
180 185 iy 

Thr Pro lie Arg Asp Asp Ala Gly Thr Asp Ser He Val Ser Gly Phe 
!95 200 205 

Trp Leu Ser Gly Leu Cys Asp Asn Trp Gly Ser Ser Thr Asp Thr Trp 
210 215 220 

Lys Trp Trp Glu Lys His Tyr Thr Asn Thr Phe Glu Thr Gly Arg Ala 
225 230 235 240 

Arg Asp Met Arg Ser Tyr Ala Ser Glu Pro Glu Ser Met He Ala Met 
245 250 
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Glu Met Met Asn Val Tyr Thr Gly Gly Gly Thr Val Tyr Asn Phe Glu 
260 265 270 

Cys Ala Ala Tyr Thr Phe Met Thr Asn Asp Val Pro Thr Pro Ala Phe 
275 280 285 

Thr Lys Gly lie lie Pro Phe Phe Arg His Ala lie Gin Asn Pro Ala 
290 295 300 

Pro Ser Lys Glu Glu Val Val Asn Arg Thr Lys Ala Val Phe Trp Asn 
305 310 315 320 

Gly Glu Gly Arg lie Ser Ser Leu Asn Gly Phe Tyr Gin Gly Leu Tyr 
325 330 335 

Ser Asn Asp Glu Thr Met Pro Leu Tyr Asn Asn Gly Arg Tyr His He 
340 • 345 350 

Leu Pro Val He His Glu Lys He Asp Lys Glu Lys He Ser Ser He 
355 360 365 

Phe Pro Asn Ala Lys He Leu Thr Lys Asn Ser Glu Glu Leu Ser Ser 
370 375 380 

Lys Val Asn Tyr Leu Asn Ser Leu Tyr Pro Lys Leu Tyr Glu Gly Asp 
385 390 395 400 

Gly Tyr Ala Gin Arg Val Gly Asn Ser Trp Tyr He Tyr Asn Ser Asn 
405 410 415 

Ala Asn He Asn Lys Asn Gin Gin Val Met Leu Pro Met Tyr Thr Asn 
420 425 430 

Asn Thr Lys Ser Leu Ser Leu Asp Leu Thr Pro His Thr Tyr Ala Val 
435 440 445 

Val Lys Glu Asn Pro Asn Asn Leu His He Leu Leu Asn Asn Tyr Arg 
450 455 460 

Thr Asp Lys Thr Ala Met Trp Ala Leu Ser Gly Asn Phe Asp Ala Ser 
465 470 475 480 

Lys "Ser Trp Lys Lys Glu Glu Leu Glu Leu Ala Asn Trp He Ser Lys 
485 490 495 

Asn Tyr Ser He Asn Pro Val Asp Asn Asp Phe Arg Thr Thr Thr Leu 
500 505 510 

Thr Leu Lys Gly His Thr Gly His Lys Pro Gin He Asn He Ser Gly 
515 520 525 

Asp Lys Asn His Tyr Thr Tyr Thr Glu Asn Trp Asp Glu Asn Thr His 
530 535 540 

Val Tyr Thr He Thr Val Asn His Asn Gly Met Val Glu Met Ser He 
545 550 555 560 

Asn Thr Glu Gly Thr Gly Pro Val Ser Phe Pro Thr Pro Asp Lys Phe 
565 570 575 

Asn Asp Gly Asn Leu Asn He Ala Tyr Ala Lys Pro Thr Thr Gin Ser 
580 585 590 

Ser Val Asp Tyr Asn Gly Asp Pro Asn Arg Ala Val Asp Gly Asn Arg 
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595 600 605 

Asn Gly Asn Phe Asn Ser Gly Ser Val Thr His Thr Arg Ala Asp Asn 
610 615 620 

Pro Ser Trp Trp Glu Val Asp Leu Lys Lys Met Asp Lys Val Gly Leu 
625 630 635 640 

Val Lys lie Tyr Asn Arg Thr Asp Ala Glu Thr Gin Arg Leu Ser Asn 
645 650 655 

Phe 

(2) INFORMATION FOR SEQ ID NO: 2 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05: 

CTGTGGCAAT CAGTCAGCTG CTTCCAAACA GTCAGCTTCA GGAAC GATTG AGGTGATTTC 60 

AC G AGAAAAT GGCTCTGGGA CACGGGGTGC CTTCACAGAA ATCACAGGGA TTCTCAAAAA 120 

AG AC GGTG AT AAAAAAATTG ACAACACTGC CAAAACAGCT GTGATTCAAA ATAGTACAGA 180 

AGGTGTTCTC TCAGCAGTTC AAGGGAATGC TAATGC TATC GGCTACATCT C C TTGGGATC 240 

TTTAACGAAA TCTGTCAAGG CTTTAGAGAT TGATGGTGTC AAGGCTAGTC GAGACACAGT 3 00 

TTTAGATGGT GAATACCCTC TTCAACGTCC CTTCAACATT GTTTGGTCTT CTAATCTTTC 3 60 

CAAGCTAGGT CAAGATTTTA TCAGCTTTAT CCACTCCAAA CAAGGTCAAC AAGTGGTCAC 420 

AGATAATAAA TTTATTGAAG CTAAAACCGA AACCACGGAA TATACAAGCC AAC AC TTATC 480 

AGGCAAGTTG TC TGTTGT AG GTTCCACTTC AGTATCTTCT TTAATGGAAA AATTAGCAGA 540 

AGC TT AT AAA AAAGAAAATC C AG AAGTT AC GATTGATATT ACCTCTAATG GGTCTTCAGC 60 0 

AGGTATTACC GCTGTTAAGG AGAAAACCGC TGATATTGGT ATGGTTTCTA GGGAATTAAC 6 60 

TCCTGAAGAA GGTAAGAGTC TCACCCATGA TGCTATTGCT TTAGAC GGTA TTGCTGTTGT 7 20 

GGTCAATAAT GACAATAAGG CAAGCCAAGT CAGTATGGCT GAAC TTGC AG AC GTTTTT AG 7 80 

TGGCAAATTA AC C AC C TGGG ACAAGATTAA A 811 
(2) INFORMATION FOR SEQ ID NO: 20 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



197 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

Cvs Gly Asn Gin Ser Ala Ala Ser Lys Gin Ser Ala Ser Gly Thr He 
- 10 15 



1 5 



Glu Val He Ser Arg Glu Asn Gly Ser Gly Thr Arg Gly Ala Phe Thr 
20 25 30 

Glu He Thr Gly He Leu Lys Lys Asp Gly Asp Lys Lys He Asp Asn 
35 40 45 

Thr Ala Lys Thr Ala Val He Gin Asn Ser Thr Glu Gly Val Leu Ser 
50 55 60 

Ala Val Gin Gly Asn Ala Asn Ala He Gly Tyr lie Ser Leu Gly Ser 
65 70 75 80 

Leu Thr Lys Ser Val Lys Ala Leu Glu He Asp Gly Val Lys Ala Ser 
85 90 . 95 

Arg Asp Thr Val Leu Asp Gly Glu Tyr Pro Leu Gin Arg Pro Phe Asn 
100 105 110 

He Val Trp Ser Ser Asn Leu Ser Lys Leu Gly Gin Asp Phe He Ser 
115 120 125 

Phe He His Ser Lys Gin Gly Gin Gin Val Val Thr Asp Asn Lys Phe 
130 135 140 

He Glu Ala Lys Thr Glu Thr Thr Glu Tyr Thr Ser Gin His Leu Ser 
145 150 155 160 

Gly Lys Leu Ser Val Val Gly Ser Thr Ser Val Ser Ser Leu Met Glu 
165 170 175 

Lys Leu Ala Glu Ala Tyr Lys Lys Glu Asn Pro Glu Val Thr He Asp 
180 185 190 

He Thr Ser Asn Gly Ser Ser Ala Gly He Thr Ala Val Lys Glu Lys 
195 200 205 

Thr Ala Asp He Gly Met Val Ser Arg .Glu Leu Thr- Pro Glu Glu Gly 
210 215 220 

Lys Ser Leu Thr His Asp Ala He Ala Leu Asp Gly lie Ala Val Val 
225 230 235 240 

Val Asn Asn Asp Asn Lys Ala Ser Gin Val Ser Met Ala Glu Leu Ala 
245 250 255 

Asp Val Phe Ser Gly Lys Leu Thr Thr Trp Asp Lys He Lys 
260 265 270 

INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



TTGTCAACAA 


CAACATGCTA 


CTTCTGAGGG 


GACGAATCAA 


AGGCAAAGCA 


GTTCAGCGAA 


60 


AGTTCCATGG 


AAAGC TTC AT 


ACACCAACCT 


AAACAACCAG 


GTAAGTACAG 


AAGAGGTCAA 


120 


ATCTCTCTTA 


TCAGCTCACT 


TGGATCCAAA 


TAGTGTTGAT 


GCATTTTTTA 


ATCTCGTTAA 


180 


TGACTATAAT 


ACCATTGTCG 


GCTCAACTGG 


CTTATCAGGA 


GATTTCACTT 


CCTTTACTCA 


240 


CACCGAATAC 


GATGTTGAGA 


AAATCAGTCA 


TCTCTGGAAT 


CAAAAGAAGG 


GC GATTTTGT 


300 


TGGGACCAAC 


TGCCGTATCA 


ATAGTTATTG 


TC TTTTG AAA 


AATTCAGTCA 


CCATTCCAAA 


360 


GCTTGAAAAG 


AATGACCAGT 


TGCTTTTCCT 


AGATAATGAT 


GCGATTGATA 


AAGGAAAGGT 


420 


CTTTGATTCA 


CAAGATAAGG 


AAGAGTTTGA 


TATTCTATTT 


TCGAGAGTTC 


CAACTGAGTC 


480 


AACTACAGAT 


GTCAAGGTTC 


AC GC TGAAAA 


GATGGAAGCA 


TTCTTCTCAC 


AATTTCAATT 


540 


CAATGAAAAA 


GCTCGAATGC 


TGTC TGTAGT 


CTTGC AC GAC 


AATTTGGATG 


GCGAGTATCT 


600 


GTTTGTAGGC 


C AC GTTGGGG 


TCTTAGTACC 


TGC TGATG AC 


GGTTTC TT AT 


TTGTAGAGAA 


660 


ATTGACTTTC 


GAAGAGCCCT 


ACCAAGCGAT 


TAAATTTGCT 


AGTAAGGAAG 


ATTGCTACAA 


720 


GTATTTGGGC 


ACCAAGTATG 


C GGATT ATAC 


AGGC GAGGG A 


CTGGCTAAGC 


C TTTTATC AT 


780 


GGATAATGAT 


AAGTGGGTTA 


AACTT 








805 



(2) INFORMATION FOR SEQ ID NO: 2 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 08: 

Cvs Gin Gin Gin His Ala Thr Ser Glu Gly Thr Asn Gin Arg Gin Ser 
1 5-10 15 

Ser Ser Ala Lys Val Pro Trp Lys Ala Ser Tyr Thr Asn Leu Asn Asn 
20 25 30 

Gin Val Ser Thr Glu Glu Val Lys Ser Leu Leu Ser Ala His Leu Asp 
35 40 45 

Pro Asn Ser Val Asp Ala Phe Phe Asn Leu Val Asn Asp Tyr Asn Thr 
50 55 60 

He Val Gly Ser Thr Gly Leu Ser Gly Asp Phe Thr Ser Phe Thr His 
65 70 75 80 

Thr Glu Tyr Asp Val Glu Lys He Ser His Leu Trp Asn Gin Lys Lys 
85 90 95 

Gly Asp Phe Val Gly Thr Asn Cys Arg He Asn Ser Tyr Cys Leu Leu 
100 105 HO 
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Lys Asn Ser Val Thr lie Pro Lys Leu Glu Lys Asn Asp Gin Leu Leu 
115 120 125 

Phe Leu Asp Asn Asp Ala lie Asp Lys Gly Lys Val Phe Asp Ser Gin 
130 135 140 

Aso Lys Glu Glu Phe Asp He Leu Phe Ser Arg Val Pro Thr Glu Ser 
145 150 155 160 

Thr Thr Asp Val Lys Val His Ala Glu Lys Met Glu Ala Phe Phe Ser 
165 170 175 

Gin Phe Gin Phe Asn Glu Lys Ala Arg Met Leu Ser Val Val Leu His 
180 185 190 

Asp Asn Leu Asp Gly Glu Tyr Leu Phe Val Gly His Val Gly Val Leu 
195 200 205 

Val Pro Ala Asp Asp Gly Phe Leu Phe Val Glu Lys Leu Thr Phe Glu 
210 215 • 220 

Glu Pro Tyr Gin Ala He Lys Phe Ala Ser Lys Glu Asp Cys Tyr Lys 
225 230 235 240 

Tvr Leu Gly Thr Lys Tyr Ala Asp Tyr Thr Gly Glu Gly Leu Ala Lys 
245 250 255 

Pro Phe He Met Asp Asn Asp Lys Trp Val Lys Leu 
260 265 

(2) INFORMATION FOR SEQ ID NO : 2 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TTGTTCAGGC AAGTCCGTGA C T AGTGAAC A CCAAACGAAA GATGAAATGA AGAC GGAGC A 6 0 

GACAGCTAGT AAAACAAGCG CAGCTAAAGG GAAAGAGGTG GCTGATTTTG AATTGATGGG 12 0 

AGTAGATGGC AAGACCTACC GTTTATC TGA TTACAAGGGC AAGAAAGTCT ATCTCAAATT 18 0 

CTGGGCTTCT TGGTGTTCCA TCTGTCTGGC TAGTCTTCCA GAT AC GGATG AGATTGCTAA 240 

AGAAGC TGGT GATGACTATG TGGTCTTGAC AGTAGTGTCA C C AGGAC AT A AGGGAGAGCA 3 00 

ATCTGAAGCG GACTTTAAGA ATTGGTATAA GGGATTGGAT TATAAAAATC TCCCAGTCCT 3 60 

AGTTGACCCA TCAGGCAAAC TTTTGGAAAC TTATGGTGTC CGTTCTTACC CAACCCAAGC 42 0 

C TTT AT AG AC AAAGAAGGCA AGCTGGTCAA AACACATCCA GGATTCATGG AAAAAGATGC 480 

AATTTTGCAA ACTTTGAAGG AATTAGCC 508 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Cvs Ser Glv Lys Ser Val Thr Ser Glu His Gin Thr Lys Asp Glu Met 
15 10 15 

Lvs Thr Glu Gin Thr Ala Ser Lys Thr Ser Ala Ala Lys Gly Lys Glu 



20 



Val Ala Asp Phe Glu Leu Met Gly Val Asp Gly Lys Thr Tyr Arg Leu 
35 40 45 

Ser Asp Tyr Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp 



55 60 

Cvs Ser He Cys Leu Ala Ser Leu Pro Asp Thr Asp Glu He Ala Lys 
65 70 75 80 

Glu Ala Gly Asp Asp Tyr Val Val Leu Thr Val Val Ser Pro Gly His 
85 90 95 

Lvs Gly Glu Gin Ser Glu Ala Asp Phe Lys Asn Trp Tyr Lys Gly Leu 
100 105 HO 

Asp Tyr Lys Asn Leu Pro Val Leu Val Asp Pro Ser Gly Lys Leu Leu 
115 120 125 

Glu Thr Tyr Gly Val Arg Ser Tyr Pro Thr Gin Ala Phe He Asp Lys 
130 135 140 

Glu Gly Lys Leu Val Lys Thr His Pro Gly Phe Met Glu Lys Asp Ala 
145 150 155 160 

He Leu Gin Thr Leu Lys Glu Leu Ala 
165 

(2) INFORMATION FOR SEQ ID NO : 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CTCGCAAATT GAAAAGGCGG CAGTTAGCCA AGGAGGAAAA GCAGTGAAAA AAACAGAAAT 6 0 

TAGTAAAGAC GCAGACTTGC AC G AAATTT A TCTAGCTGGA GGTT.GTTTCT GGGGAGTGGA 12 0 

GGAATATTTC TCACGTGTTC CCGGGGTGAC GGATGCCGTT TCAGGCTATG CAAATGGTAG 18 0 

AGGAGAAACA ACCAAGTACG AATTGATTAA CCAAACAGGT CATGCAGAAA CCGTCCATGT 240 

CACCTATGAT GCCAAGCAAA TTTCTCTCAA GGAAATCCTG CTTCACTATT TC C GC ATT AT 3 00 
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CAATCCAACC AGC AAAAAT A AACAAGGAAA TGATGTGGGG ACCCAGTACC GTACTGGTGT 3 60 

TTATTACACA GATGACAAGG ATTTGGAAGT GATTAACCAA GTCTTTGATG AGGTGGCTAA 42 0 

G AAAT AC GAT CAACCTCTAG CAGTTGAAAA GGAAAACTTG AAGAATTTTG TGGTGGCTGA 48 0 

GGATTACCAT CAAGACTATC TCAAGAAAAA TCCAAATGGC TACTGCCATA TCAATGTTAA 540 

TCAGGCGGCC TATCCTGTCA TTGATGC C AG C AAATATC C A AAACCAAGTG ATGAGGAATT 60 0 

G AAAAAG AC C CTGTCACCTG AGGAGTATGC AGTTACCCAG GAAAATCAAA CAGAACGAGC 6 60 

TTTCTCAAAC CGTTACTGGG ATAAATTTGA ATCCGGTATC TATGTGGATA TAGCAACTGG 72 0 

GGAACCTCTC TTTTCATCAA AAGACAAATT TGAGTCTGGT TGTGGCTGGC C TAGTTTT AC 780 

CCAACCCATC AGTCCAGATG TTGTCACCTA CAAGGAAGAT AAGTCCTACA ATATGACGCG 840 

TATGGAAGTG CGGAGCCGAG T AGG AG ATT C TCACCTTGGG CATGTCTTTA CGGATGGTCC 9 00 

ACAGGACAAG GGCGGCTTAC GTTAC TGT AT CAATAGCCTC TCTATCCGCT TTATTOCCAA • 9 60 

AGACCAAATG GAAGAAAAAG GT AC GC TT AT TTAC 994 
(2) INFORMATION FOR SEQ ID NO:212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

Ser Gin He Glu Lys Ala Ala Val Ser Gin Gly Gly Lys Ala Val Lys 
1 5 10 .15 

Lys Thr Giu He Ser Lys Asp Ala Asp Leu His Glu He Tyr Leu Ala 
20 25 30 

Glv Gly Cys Phe Trp Gly Val Glu Glu Tyr Phe Ser Arg Val Pro Gly 
35 40 45 

Val Thr Asp Ala Val Ser Gly Tyr Ala Asn Gly Arg Gly Glu Thr Thr 
50 55 60 

Lys Tyr Glu Leu He Asn Gin Thr Gly His Ala Glu Thr Val His Val 
65 70 75 80 

Thr Tyr Asp Ala Lys Gin He Ser Leu Lys Glu He Leu Leu His Tyr 
85 90 95 

Phe Arg He He Asn Pro Thr Ser Lys Asn Lys Gin Gly Asn Asp Val 
100 105 HO 

Glv Thr Gin Tyr Arg Thr Gly Val Tyr Tyr Thr Asp Asp Lys Asp Leu 
115 120 125 

Glu Val He Asn Gin Val Phe Asp Glu Val Ala Lys Lys Tyr Asp Gin 
130 135 140 
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Pro Leu Ala Val Glu Lys Glu Asn Leu Lys Asn Phe Val Val Ala Glu 
145 150 155 160 

Asp Tyr His Gin Asp Tyr Leu Lys Lys Asn Pro Asn Gly Tyr Cys His 
165 170 175 

He Asn Val Asn Gin Ala Ala Tyr Pro Val He Asp Ala Ser Lys Tyr 
180 185 190 

Pro Lys Pro Ser Asp Glu Glu Leu Lys Lys Thr Leu Ser Pro Glu Glu 
195 200 205 

Tyr Ala Val Thr Gin Glu Asn Gin. Thr Glu Arg Ala Phe Ser Asn Arg 
210 215 220 

Tyr Trp Asp Lys Phe Glu Ser Gly He Tyr Val Asp He Ala Thr Gly 
225 230 235 240 

Glu Pro Leu Phe Ser Ser Lys Asp Lys Phe Glu Ser Gly Cys Gly Trp 
245 250 255 

Pro Ser Phe Thr Gin Pro He Ser Pro Asp Val Val Thr Tyr Lys Glu 
260 265 270 

Asp Lys Ser Tyr Asn Met Thr Arg Met Glu Val Arg Ser Arg Val Gly 
275 . 280 285 

Asp Ser His Leu Gly His Val Phe Thr Asp Gly Pro Gin Asp Lys Gly 
290 295 300 

Gly Leu Arg Tyr Cys He Asn Ser Leu Ser He Arg Phe He Pro Lys 
305 310 315 320 

Asp Gin Met Glu Glu Lys Gly Thr Leu He Tyr 
325 330 

(2) INFORMATION FOR SEQ ID NO : 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TTGTCAGTCA GGTTCTAATG GTTCTCAGTC TGCTGTGGAT GCTATCAAAC AAAAAGGGAA 60 

ATTAGTTGTG GCAACCAGTC CTGACTATGC AC C C TTTGAA TTTCAATCAT TGGTTGATGG 12 0 

AAAGAAC C AG GTAGTC GGTG CAGACATCGA CATGGCTCAG GCTATCGCTG ATGAACTTGG 180 

GGTTAAGTTG GAAATCTCAA GCATGAGTTT TGACAATGTT TTGACCAGTC TTCAAACTGG 240 

TAAGGCTGAC CTAGCAGTTG CAGGAATTAG TGCTACTGAC GAGAGAAAAG AAGTCTTTGA 3 00 

TTTTTCAATC C CAT AC TATG AAAACAAGAT TAGTTTCTTG GTTCGTAAGG CTGATGTGGA 3 60 

AAAATACAAG GATTTAACTA GCCTAGAAAG TGCTAATATT GCAGCCCAAA AAGGGAC TGT 42 0 

TCCAGAATCA ATGGTCAAGG AACAATTGCC AAAAGTTCAA TTAACTTCCC TAACTAATAT 480 
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GGGTGAAGCA GTCAATGAAT TGCAGGCTGG AAAAATAGAT GC TGTTC AT A TGGATGAGCC 540 

TGTTGCACTT AGTTATGCTG CTAAAAACGC TGGCTTAGCT GTCGCAACTG TCAGCTTGAA 600 

GATGAAGGAC GGCGACGCCA ATGCC 625 
(2) INFORMATION FOR SEQ ID NO: 2 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

C ys Gin Ser Gly Ser Asn Gly Ser Gin Ser Ala Val Asp Ala He Lys 
- 10 15 



1 5 



Gin Lys Gly Lys Leu Val Val Ala Thr Ser Pro Asp Tyr Ala Pro Phe 
20 25 30 

Glu Phe Gin Ser Leu Val Asp Gly Lys Asn Gin Val Val Gly Ala Asp 
35 40 . 45 

He Asp Met Ala Gin Ala He Ala Asp Glu Leu Gly Val Lys Leu Glu 
50 55 60 

He Ser Ser Met Ser Phe Asp Asn Val Leu Thr Ser Leu Gin Thr Gly 
65 70 75 80 

Lys Ala Asp Leu Ala Val Ala Gly He Ser Ala Thr Asp Glu Arg Lys 
85 90 95 

Glu Val Phe Asp Phe Ser He Pro Tyr Tyr Glu Asn Lys He Ser Phe 
100 105 HO 

Leu Val Arg Lys Ala Asp Val Glu Lys Tyr Lys Asp Leu Thr Ser Leu 
115 120 125 

Glu Ser Ala Asn He Ala Ala Gin Lys Gly Thr Val Pro Glu Ser Met 
130 135 140 

Val Lys Glu Gin Leu Pro Lys Val Gin Leu Thr Ser Leu Thr Asn Met 
145 150 155 160 

Glv Glu Ala Val Asn Glu Leu Gin Ala Gly Lys He Asp Ala Val His 
165 170 175 

Met Asp Glu Pro Val Ala Leu Ser Tyr Ala Ala Lys Asn Ala Gly Leu 
180 185 190 

Ala Val Ala Thr Val Ser Leu Lys Met Lys Asp Gly Asp Ala Asn Ala 
195 200 ' 205 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 215: 



GGAAACTTCA 


CAGGATTTTA 


AAGAGAAGAA 


AACAGCAGTC 


ATTAAGGAAA 


AAGAAGTTGT 


60 


TAGTAAAAAT 


CCTGTGATAG 


ACAATAACAC 


TAGCAATGAA 


GAAGCAAAAA 


TCAAAGAAGA 


12 0 


AAATTCCAAT 


AAATCCCAAG 


GAGATTATAC 


GGACTCATTT 


GTGAATAAAA 


ACACAGAAAA 


180 


TCCCAAAAAA 


GAAGATAAAG 


TTGTCTATAT 


TGCTGAATTT 


AAAGATAAAG 


AATC TGGAGA 


240 


AAAAGCAATC 


AAGGAACTAT 


CCAGTC TTAA 


GAATACAAAA 


G TTTT AT AT A 


CTTATGATAG 


300 


AATTTTTAAC 


GGTAGTGCCA 


TAGAAACAAC 


TCCAGATAAC 


TTGGACAAAA 


TTAAACAAAT 


360 


AGAAGGTATT 


TCATCGGTTG 


AAAGGGCACA 


AAAAGTCCAA 


CCCATGATGA 


ATCATGCCAG 


420 


AAAGGAAATT 


GGAGTTGAGG 


AAGCTATTGA 


TTACCTAAAG 


TCTATCAATG 


CTCCGTTTGG 


480 


GAAAAATTTT 


GATGGTAGAG 


GTATGGTCAT 


TTCAAATATC 


GAT AC TGGAA 


CAGATTATAG 


540 


AC AT AAGGC T 


ATGAGAATCG 


ATGATGATGC 


CAAAGCCTCA 


AT GAG ATTT A 


AAAAAGAAGA 


600 


CTTAAAAGGC 


AC TGAT AAAA 


ATTATTGGTT 


GAGTGATAAA 


ATCCCTCATG 


CGTTCAATTA 


660 


TTATAATGGT 


GGCAAAATCA 


CTGTAGAAAA 


ATATGATGAT 


GGAAGGGATT 


ATTTTG AC C C 


720 


ACATGGGATG 


CATATTGCAG 


GGATTCTTGC 


TGGAAATGAT 


ACTGAACAAG 


ACATCAAAAA 


780 


CTTTAACGGC 


ATAGATGGAA 


TTGCACCTAA 


TGCACAAATT 


TTCTCTTACA 


AAATGTATTC 


840 


TGACGCAGGA 


TCTGGGTTTG 


CGGGTGATGA 


AACAATGTTT 


CATGCTATTG 


AAGATTC TAT 


900 


CAAACACAAC 


GTTGATGTTG 


TTTCGGTATC 


ATCTGGTTTT 


ACAGGAACAG 


GTCTTGTAGG 


960 


TGAGAAATAT 


TGGCAAGCTA 


TTCGGGCATT 


AAGAAAAGCA 


GGCATTCCAA 


TGGTTGTCGC 


102 0 


T AC GGGTAAC 


TATGCGACTT 


CTGCTTCAAG 


TTCTTCATGG 


GATTTAGTAG 


CAAATAATCA 


1080 


TCTGAAAATG 


ACCGACACTG 


GAAATGTAAC 


ACGAACTGCA 


GCACATGAAG 


ATGCGATAGC 


1140 


GGTCGCTTCT 


GCTAAAAATC 


AAACAGTTGA 


GTTTGATAAA 


GTTAACATAG 


GTGGAGAAAG 


1200 


TTTTAAATAC 


AGAAATATAG 


GGGCCTTTTT 


CGATAAGAGT 


AAAATCACAA 


CAAATGAAGA 


1260 


TGGAACAAAA 


GCTCCTAGTA 


AATTAAAATT 


TGTATATATA 


GGCAAGGGGC 


AAGACCAAGA 


1320 


TTTGATAGGT 


TTGGATCTTA 


GGGGCAAAAT 


TGCAGTAATG 


GATAGAATTT 


ATACAAAGGA 


1380 


TTTAAAAAAT 


GCTTTTAAAA 


AAGC TATGG A 


TAAGGGTGCA 


CGCGCCATTA 


TGGTTGTAAA 


1440 


TACTGTAAAT 


TACTACAATA 


GAGATAATTG 


GACAGAGCTT. CCAGCTATGG 


GATATGAAGC 


1500 


GGATGAAGGT 


AC TAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


GATGATGGTG 


TAAAGCTATG 


1560 


GAACATGATT 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA 


1620 


AGATAAATTG 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTC C A 


ACAAACCGAA 


1680 


TGTAGGTGAC 


GAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAAC TCTA 


1740 
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TAAAGAAGAT 


ATCATCGTTC 


CAGCAGGATC 


TACATCTTGG 


GGGCCAAGAA 


TAGATTTACT 


1800 


TTTAAAACCC 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATGTTATTAA 


1860 


TGGCAAATCA 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


1920 


TTCTACTGTT 


TTGATTAGAC 


CGAAATTAAA 


GGAAATGC TT 


GAAAGACCTG 


TATTGAAAAA 


1980 


TCTTAAGGGA 


GATGACAAAA 


TAGATCTTAC 


AAGTC TTAC A 


AAAATTGCCC 


TACAAAATAC 


2040 


TGCGCGACCT 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


2100 


TAGACAACAG 


GGAGCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


2160 


AACTTTCAAA 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTC C A 


TTTCTCTTAA 


2220 


AGAAATAAAA 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


2280 


TTTGACTTTT 


AAAGTTTCAG 


CATC AGC GAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


2340 


ACTTGATGAA 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


2400 


TCACCCAGAA 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


2460 


CGCAAATTCT 


AGCTTTGATT 


TGAATGCGGT 


TATAAATGTT 


GGAGAGGC C A 


AAAACAAAAA 


2520 


TAAATTTGTA 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GC GATGGAAG 


CTCTAAACTC 


2580 


C AGC GGGAAG 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


2640 


GAATTGGAAC 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


2700 


AAC AC TGGGA 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG ' 


2760 


AATTGGTGGA 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


2820 


AGATAAAAAT 


ACAACATCCC 


TGGATCAAAA 


TC C AGAATTA 


TTTGCTTTCA 


ATAACGAAGG 


2880 


GATCAACGCT 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


2940 


TGGAAATCCT 


CAAGATGCTC 


AAC TTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


3000 


AAGTGCAGAA 


GAAGGATTGA 


TT 
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(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Glu Thr Ser Gin Asp Phe Lys Glu Lys Lys Thr Ala Val He Lys Glu 
± 5 10 15 

Lys Glu Val Val Ser Lys Asn Pro Val He Asp Asn Asn Thr Ser Asn 
20 25 30 

Glu Glu Ala Lys He Lys Glu Glu Asn Ser Asn Lys Ser Gin Gly Asp 
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35 



40 45 



Tyr Thr Asp Ser Phe Val Asn Lys Asn Thr Glu Asn Pro Lys Lys Glu 
50 55 60 

Asp Lys Val Val Tyr lie Ala Glu Phe Lys Asp Lys Glu Ser Gly Glu 
65 70 75 80 

Lys Ala He Lys Glu Leu Ser Ser Leu Lys Asn Thr Lys Val Leu Tyr 
85 90 95 

Thr Tyr Asp Arg He Phe Asn Gly Ser Ala He Glu Thr Thr Pro Asp 
100 105 HO 

Asn Leu Asp Lys He Lys Gin He Glu Gly He Ser Ser Val Glu Arg 
115 120 125 

Ala Gin Lys Val Gin Pro Met Met Asn His Ala Arg Lys Glu He Gly 
130 135 140 

Val Glu Glu Ala He Asp Tyr Leu Lys Ser He Asn Ala Pro Phe Gly 
145 150 155 160 

Lvs Asn Phe Asp Gly Arg Gly Met Val He Ser Asn He Asp Thr Gly 
165 170 175 

Thr Asp Tyr Arg His Lys Ala Met Arg He Asp Asp Asp Ala Lys Ala 
180 185 190 

Ser Met Arg Phe Lys Lys Glu Asp Leu Lys Gly Thr Asp Lys Asn Tyr 
195 200 205 

Trp Leu Ser Asp Lys He Pro His Ala Phe Asn Tyr Tyr Asn Gly Gly 
210 215 220 

Lvs He Thr Val Glu Lys Tyr Asp Asp Gly Arg Asp Tyr Phe Asp Pro 
225 230 235 240 

His Gly Met His He Ala Gly He Leu Ala Gly Asn Asp Thr Glu Gin 
245 250 255 

Asp He Lys Asn Phe Asn Gly He Asp Gly He Ala Pro Asn Ala Gin 
260 265 270 

He Phe Ser Tyr Lys Met Tyr Ser Asp Ala Gly Ser Gly Phe Ala Gly 
275 280 285 

Asp Glu Thr Met Phe His Ala He Glu Asp Ser He Lys His Asn Val 
290 295 300 

Asp Val Val Ser Val Ser Ser Gly Phe Thr Gly Thr Gly Leu Val Gly 
305 310 315 320 

Glu Lys Tyr Trp Gin Ala He Arg Ala Leu Arg Lys Ala Gly He Pro 
325 330 335 

Met Val Val Ala Thr Gly Asn Tyr Ala Thr Ser Ala Ser Ser Ser Ser 
340 345 350 

Trp Asp Leu Val Ala Asn Asn His Leu Lys Met Thr Asp Thr Gly Asn 
355 360 365 

Val Thr Arg Thr Ala Ala His Glu Asp Ala He Ala Val Ala Ser Ala 
370 375 380 
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Lys Asn Gin Thr Val Glu Phe Asp Lys Val Asn lie Gly Gly Glu Ser 
385 390 395 400 

Phe Lys Tyr Arg Asn lie Gly Ala Phe Phe Asp Lys Ser Lys lie Thr 
405 410 415 

Thr Asn Glu Asp Gly Thr Lys Ala Pro Ser Lys Leu Lys Phe Val Tyr 
420 425 430 

lie Gly Lys Gly Gin Asp Gin Asp Leu lie Gly Leu Asp Leu Arg Gly 
435 440 445 

Lys lie Ala Val Met Asp Arg lie Tyr Thr Lys Asp Leu Lys Asn Ala 
450 455 460 

Phe Lys Lys Ala Met Asp Lys Gly Ala Arg Ala lie Met Val Val Asn 
465 470 475 480 

Thr Val Asn Tyr Tyr Asn Arg Asp Asn Trp Thr Glu Leu Pro Ala Met 
485 490 495 

Gly Tyr Glu Ala Asp Glu Gly Thr Lys Ser Gin Val Phe Ser He Ser 
500 505 510 

Glv Asp Asp Gly Val Lys Leu Trp Asn Met He Asn Pro Asp Lys Lys 
515 520 525 

Thr Glu Val Lys Arg Asn Asn Lys Glu Asp Phe Lys Asp Lys Leu Glu 
530 535 540 

Gin Tyr Tyr Pro He Asp Met Glu Ser Phe Asn Ser Asn Lys Pro Asn 
545 550 555 560 

Val Gly Asp Glu Lys Glu He Asp Phe Lys Phe Ala Pro Asp Thr Asp 
565 570 575 

Lys Glu Leu Tyr Lys Glu Asp He He Val Pro Ala Gly Ser Thr Ser 
580 585 590 

Trp Gly Pro Arg lie Asp Leu Leu Leu Lys Pro Asp Val Ser Ala Pro 
595 600 605 

Gly Lys Asn He Lys Ser Thr Leu Asn Val He Asn Gly Lys Ser Thr 
610 615 620 

Tvr Glv Tyr Met Ser Gly Thr Ser Met Ala Thr Pro He Val Ala Ala 
625 630 635 640 

Ser Thr Val Leu He Arg Pro Lys Leu Lys Glu Met Leu Glu Arg Pro 
645 650 655 

Val Leu Lys Asn Leu Lys Gly Asp Asp Lys He Asp Leu Thr Ser Leu 
660 665 670 

Thr Lys He Ala Leu Gin Asn Thr Ala Arg Pro Met Met Asp Ala Thr 
675 680 685 

Ser Trp Lys Glu Lys Ser Gin Tyr Phe Ala Ser Pro Arg Gin Gin Gly 
690 695 700 

Ala Gly Leu He Asn Val Ala Asn Ala Leu Arg Asn Glu Val Val Ala 
705 710 715 720 



208 



Thr Phe Lys Asn Thr Asp Ser Lys Gly Leu Val Asn Ser Tyr Gly Ser 
725 730 735 

He Ser Leu Lys Glu He Lys Gly Asp Lys Lys Tyr Phe Thr He Lys 
740 745 750 

Leu His Asn Thr Ser Asn Arg Pro Leu Thr Phe Lys Val Ser Ala Ser 
755 760 765 

Ala He Thr Thr Asp Ser Leu Thr Asp Arg Leu Lys Leu Asp Glu Thr 
770 775 780 

Tyr Lys Asp Glu Lys Ser Pro Asp Gly Lys Gin He Val Pro Glu He 
785 790 795 800 

His Pro Glu Lys Val Lys Gly Ala Asn He Thr Phe Glu His Asp Thr 
805 810 815 

Phe Thr He Gly Ala Asn Ser Ser Phe Asp Leu Asn Ala Val He Asn 
820 825 830 

Val Gly Glu Ala Lys Asn Lys Asn Lys Phe Val Glu Ser Phe He His 
835 840 845 

Phe Glu Ser Val Glu Ala Met Glu Ala Leu Asn Ser Ser Gly Lys Lys 
850 855. 860 

He Asn Phe Gin Pro Ser Leu Ser Met Pro Leu Met Gly Phe Ala Gly 
865 870 875 880 

Asn Trp Asn His Glu Pro He Leu Asp Lys Trp Ala Trp Glu Glu Gly 
885 890 895 

Ser Arg Ser Lys Thr Leu Gly Gly Tyr Asp Asp Asp Gly Lys Pro Lys 
900 905 910 

He Pro Gly Thr Leu Asn Lys Gly He Gly Gly Glu His Gly He Asp 
915 920 925 

Lys Phe Asn Pro Ala Gly Val He Gin Asn Arg Lys Asp Lys Asn Thr 
930 935 940 

Thr Ser Leu Asp Gin Asn Pro Glu Leu Phe Ala Phe Asn Asn Glu Gly 
945 950 955 960 

He Asn Ala Pro Ser Ser Ser Gly Ser Lys He Ala Asn He Tyr Pro 
965 970 975 

Leu Asp Ser Asn Gly Asn Pro Gin Asp Ala Gin Leu Glu Arg Gly Leu 
980 985 990 

Thr Pro Ser Pro Leu Val Leu Arg Ser Ala Glu Glu Gly Leu He 
995 1000 1005 

INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



TGTGGTC GAA 


GTTGAG AC TC 


CTCAATCAAT 


AACAAATCAG 


GAGCAAGCTA 


GGACAGAAAA 


60 


CCAAGTAGTA 


GAGACAGAGG 


AAGCTCCAAA 


AGAAGAAGCA 


C C T AAAAC AG 


AAGAAAGTCC 


120 


AAAGGAAGAA 


CCAAAATCGG 


AGGT AAAAC C 


TACTGACGAC 


ACCCTTCCTA 


AAGTAGAAGA 


180 


GGGGAAAGAA 


GATTCAGCAG 


AACCAGCTCC 


AGTTGAAGAA 


GTAGGTGGAG 


AAGTTGAGTC 


240 


AAAAC C AG AG 


GAAAAAGTAG 


CAGTTAAGCC 


AGAAAGTCAA 


CCATCAGACA 


AACCAGCTGA 


300 


GGAATCAAAA 


GTTGAACAAG 


CAGGTGAACC 


AGTCGCGCCA 


AG AG AAG AC G 


AAAAGGCACC 


360 


AGTCGAGCCA 


GAAAAGCAAC 


CAGAAGCTCC 


TGAAGAAGAG 


AAGGC TGTAG 


AGGAAACACC 


420 


GAAACAAGAA 


GAGTCAACTC 


C AG AT AC C AA 


GGCTGAAGAA 


AC TGTAGAAC 


CAAAAGAGGA 


480 


GACTGTTAAT 


CAATCTATTG 


AAC AAC C AAA 


AGTTGAAACG 


CCTGC TGTAG 


AAAAACAAAC 


540 


AGAACCAACA 


GAGGAACCAA 


AAGTTGAACA 


AGCAGGTGAA 


CCAGTCGCGC 


CAAGAGAAGA 


600 


CGAACAGGCA 


CCAACGGCAC 


CAGTTGAGCC 


AGAAAAGCAA 


CCAGAAGTTC 


C TG AAG AAG A 


660 


GAAGGCTGTA 


GAGGAAACAC 


C GAAAC C AG A 


AGATAAAATA 


AAGGGTATTG 


GTACTAAAGA 


720 


ACCAGTTGAT 


AAAAGTGAGT 


TAAATAATCA 


AATTGATAAA 


GCTAGTTCAG 


TTTCTCCTAC 


780 


TGATTATTCT 


ACAGCAAGTT 


ACAATGCTCT 


TGGACCTGTT 


TTAGAAAC TG 


CAAAAGGTGT 


840 


CTATGCTTCA 


GAGCCTGTAA 


AACAGCCTGA 


GGTAAATAGC 


GAGACAAATA 


AACTTAAAAC 


900 


GGCTATTGAC 


GCTCTAAACG 


TTGAT AAAAC 


TGAATTAAAC 


AATACGATTG 


CAGATGCAAA 


960 


AACAAAGGTA 


AAAGAACATT 


ACAGTGATAG 


AAGTTGGCAA 


AAC C TC C AAA 


C TG AAGTT AC 


1020 


AAAGGCTGAA 


AAAGTTGCAG 


CTAATACAGA 


TGCTAAACAA 


AGTGAAGTTA 


ACGAAGCTGT 


1080 


TGAAAAATTA 


ACTGCAACTA 


TTGAAAAATT 


GGTTGAATTA 


TCTGAAAAGC 


CAATATTAAC 


1140 


ATTGACTAGT 


ACCGATAAGA 


AAATATTGGA 


ACGTGAAGCT 


GTTGCTAAGT 


ATACTCTAGA 


1200 


AAATCAAAAC 


AAAACAAAAA 


TCAAATCAAT 


CACAGCTGAA 


TTGAAAAAAG 


GAGAAGAAGT . 


1260 


TATTAATACT 


GTAGTCCTTA 


CAGATGACAA 


GGTAACAACA 


GAAAC T AT AA 


GCGCTGCATT. 


1320 


TAAGAAC CTA 


GAGTACTACA 


AAGAATACAC 


CCTATCTACA 


AC T ATGATTT 


AC G AC AG AGG 


1380 


TAACGGTGAA 


GAAAC TG AAA 


CTCTAGAAAA 


TCAAAATATT 


CAATTAGATC 


TTAAAAAAGT 


1440 


TGAGCTTAAA 


AATATTAAAC 


GTACAGATTT 


AATCAAATAC 


GAAAATGGAA 


AAGAAACTAA 


1500 


TGAATCACTG 


ATAACAACTA 


TTCCTGATGA 


TAAGAGCAAT 


TATTATTTAA 


AAATAACTTC 


1560 


AAATAATCAG 


AAAACTACAT 


TACTAGCTGT 


TAAAAATATA 


GAAGAAACTA 


CGGTTAACGG 


1620 


AACACCTGTA 


TATAAAGTTA 


CAGCAATCGC 


AGACAATTTA 


GTCTCTAGAA 


CTGCTGATAA 


1680 


TAAATTTGAA 


, GAAGAA 
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(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

Val Val Glu Val Glu Thr Pro Gin Ser He Thr Asn Gin Glu Gin Ala 

15 10 15 

Ara Thr Glu Asn Gin Val Val Glu Thr Glu Glu Ala Pro Lys Glu Glu 

20 25 30 

Ala Pro Lys Thr Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Glu Val 

40 45 
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Lys Pro Thr Asp Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp 
50 55 60 

Ser Ala Glu Pro Ala Pro Val Glu Glu Val Gly Gly Glu Val Glu Ser 
65 70 75 80 

Lys Pro Glu Glu Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp 
85 90 95 

Lvs Pro Ala Glu Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
100 105 HO 

Pro Arg Glu Asp Glu Lys Ala Pro Val Glu Pro Glu Lys Gin Pro Glu 
115 120 125 

Ala Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu 
130 135 140 

Ser Thr Pro Asp Thr Lys Ala Glu Glu Thr Val Glu Pro' Lys Glu Glu 
145 150 155 160 

Thr Val Asn Gin Ser He Glu Gin Pro Lys Val Glu Thr Pro Ala Val 
165 170 175 

Glu Lvs Gin Thr Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly 
180 185 190 

Glu Pro Val Ala Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val 
195 200 205 

Glu Pro Glu Lys Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu 
210 215 220 

Glu Thr Pro Lys Pro Glu Asp Lys He Lys Gly He Gly Thr Lys Glu 
225 230 235 240 

Pro Val Asp Lys Ser Glu Leu Asn Asn Gin He Asp Lys Ala Ser Ser 
245 250 255 

Val Ser Pro Thr Asp Tyr Ser Thr Ala Ser Tyr Asn Ala Leu Gly Pro 
260 265 270 

Val Leu Glu Thr Ala Lys Gly Val Tyr Ala Ser Glu Pro Val Lys Gin 
— - 280 285 
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Pro Glu Val Asn Ser Glu Thr Asn Lys Leu Lys Thr Ala He Asp Ala 
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290 



295 300 



Leu Asn Val Asp Lys Thr Glu Leu Asn Asn Thr lie Ala Asp Ala Lys 
305 310 315 320 

Thr Lys Val Lys Glu His Tyr Ser Asp Arg Ser Trp Gin Asn Leu Gin 
325 330 335 

Thr Glu Val Thr Lys Ala Glu Lys Val Ala Ala Asn Thr Asp Ala Lys 
340 345 350 

Gin Ser Glu Val Asn Glu Ala Val Glu Lys Leu Thr Ala Thr He Glu 
355 360 365 

Lys Leu Val Glu Leu Ser Glu Lys Pro He Leu Thr Leu Thr Ser Thr 
370 375 380 

Asp Lys Lys He Leu Glu Arg Glu Ala Val Ala Lys Tyr Thr Leu Glu 
385 390 395 400 

Asn Gin Asn Lys Thr Lys He Lys Ser He Thr Ala Glu Leu Lys Lys 
405 410 415 

Gly Glu Glu Val He Asn Thr Val Val Leu Thr Asp Asp Lys Val Thr 
420 425 430 

Thr Glu Thr He Ser Ala Ala Phe Lys Asn Leu Glu Tyr Tyr Lys Glu 
435 440 445 

Tyr Thr Leu Ser Thr Thr Met He Tyr Asp Arg Gly Asn Gly Glu Glu 
450 455 460 

Thr Glu Thr Leu Glu Asn Gin Asn He Gin Leu Asp Leu Lys Lys Val 
465 , 470 475 480 

Glu Leu Lys Asn He Lys Arg Thr Asp Leu He Lys Tyr Glu Asn Gly 
485 490 495 

Lys Glu Thr Asn Glu Ser Leu He Thr Thr He Pro Asp Asp Lys Ser 
500 505 510 

Asn Tyr Tyr Leu Lys He Thr Ser Asn Asn Gin Lys Thr Thr Leu Leu 
515 520 525 

Ala Val Lys Asn He Glu Glu Thr Thr Val Asn Gly Thr Pro Val Tyr 
530 535 540 

Lys Val Thr Ala He Ala Asp Asn Leu Val Ser Arg Thr Ala Asp Asn 
545 550 555 560 

Lys Phe Glu Glu Glu 
565 

INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 
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AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CTGCTGATAA 60 

TAAATTTGAA GAAGAATACG TTCACTATAT TGAAAAACCT AAAGTCCACG AAGATAATGT 12 0 

ATATTATAAT TTCAAAGAAT TAGTGGAAGC TATTCAAAAC GATCCTTCAA AAGAATATCG 18 0 

TCTGGGACAA TCAATGAGCG CTAGAAATGT TGTTCCTAAT GGAAAATCAT ATATCACTAA 2 40 

AGAATTCACA GGAAAACTTT TAAGTTCTGA AGGAAAACAA TTTGCTATTA C TGAATTGG A 3 00 

ACATCCATTA TTTAATGTGA TAACAAACGC AACGATAAAT AATGTGAATT TTGAAAATGT 3 60 

AGAGATAGAA CGTTCTGGTC AAGATAATAT TGCATCATTA GCCAATACTA TGAAAGGTTC 420 

TTCAGTTATT ACAAATGTCA AAATTACAGG CACACTTTCA GGTC GTAAT A ATGTTGCTGG 480 

ATTTGTAAAT AATATGAATG ATGGAACTCG TATTGAAAAT GTTGCTTTCT TTGGCAAACT 540 

ACACTCTACA AGTGGAAATG GCTCTCATAC AGGGGGAATT GCAGGTACAA ACTATAGAGG 6 00 

AATTGTTAGA AAAGCATATG TTGATGC T AC TATTACAGGA AACAAAACAC GCGCCAGCTT 6 60 

GTTAGTTCCT AAAGTAGATT ATGGATTAAC TC TAG AC CAT CTTATTGGTA CAAAAGCTCT 72 0 

CCTAACTGAG TCGGTTGTAA AAGGTAAAAT AGATGTTTCA AATCCAGTAG AAGTTGGAGC 7 80 

AATAGCAAGT AAGACTTGGC CTGTAGGTAC GGTAAGTAAT TCTGTCAGCT ATGC TAAGAT 840 

TATCCGTGGA GAGGAGTTAT TCGGCTCTAA C G AC GTTGAT GATTCTGATT ATGCTAGTGC 900 

TCATATAAAA GATTTATATG CGGTAGAGGG AT ATTC GTC A GGTAATAGAT CATTTAGGAA 960 

ATCTAAAACA TTTACTAAAT TAACTAAAGA ACAAGCTGAT GCTAAAGTTA CTACTTTCAA 102 0 

T ATT AC TGCT GATAAATTAG AAAGTGATCT ATCTCCTCTT GCAAAACTTA ATGAAGAAAA 1080 

AGCCTATTCT AGTATTCAAG ATT AT AAC GC TGAATATAAC C AAGC C TATA AAAATC TTGA 1140 

AAAATTAATA CCATTCTACA ATAAAGATTA TATTGTATAT CAAGGTAATA AATTAAATAA 12 00 

AGAACACCAT CTAAATACTA AAGAAGTTCT TTCTGTTACC GC GATGAAC A ACAATGAGTT 12 6 0 

TATCACAAAC CTAGATGAAG CTAATAAAAT TATTGTTCAC TATGCGGACG GTACAAAAGA 132 0 

TTACTTTAAC TTGTCTTCTA GCAGTGAAGG TTTAAGTAAT GTAAAAGAAT ATACTATAAC 13 80 

TGACTTAGGA ATTAAATATA CACCTAATAT CGTTCAAAAA GATAACACTA CTCTTGTTAA 1440 

TGATATAAAA TCTATTTTAG AATCAGTAGA GCTTCAGTCT CAAACGATGT ATCAGCATCT 1500 

AAATC GATTA GGTGACTATA GAGTTAATGC AATCAAAGAT TTATATTTAG AAGAAAGCTT 1560 

CACAGATGTT AAAGAAAACT TAACAAACCT AATCACAAAA TTAGTTCAAA AC G AAGAAC A 162 0 

TCAACTAAAT GATTCTCCAG CTGCTCGTCA AATGATTCGT GATAAAGTCG AGAAAAACAA 1680 

AGCAGCTTTA TTACTAGGTT T AAC TT AC C T AAATC GTTAC TATGGAGTTA AATTTGGTGA 1740 

TGTTAATATT AAAGAATTAA TGCTATTCAA ACCAGATTTC TATGGTGAAA AAGTTAGCGT 1800 

ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT C AC GT AC ATT 186 0 

CGACGCATTC GGTCAAGTA 1879 
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INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 0: 

Thr Pro val Tyr Lys Val Thr Ala lie Ala Asp Asn Leu Val Ser Arg 
x 5 10 15 

Thr Ala Asp Asn Lys Phe Glu Glu Glu Tyr Val His Tyr lie Glu Lys 
20 25 30 

Pro Lys Val His Glu Asp Asn Val Tyr Tyr Asn Phe Lys Glu Leu Val 
35 40 45 

Glu Ala He Gin Asn Asp Pro Ser Lys Glu Tyr Arg Leu Gly Gin Ser 
50 55 60 

Met Ser Ala Arg Asn Val Val Pro Asn Gly Lys Ser Tyr He Thr Lys 
65 70 75 80 

Glu Phe Thr Gly Lys Leu Leu Ser Ser Glu Gly Lys Gin Phe Ala He 
85 90 95 

Thr Glu Leu Glu His Pro Leu Phe Asn Val He Thr Asn Ala Thr He 
100 105 HO 

Asn Asn Val Asn Phe Glu Asn Val Glu He Glu Arg Ser Gly Gin Asp 
115 120 125 

Asn He Ala Ser Leu Ala Asn Thr Met Lys Gly Ser Ser Val He Thr 
130 135 140 

Asn Val Lys He Thr Gly Thr Leu Ser Gly Arg Asn Asn Val Ala Gly 
145 150 155 160 

Phe Val Asn Asn Met Asn Asp Gly Thr Arg He Glu Asn Val Ala Phe 
165 170 175 

Phe Gly Lys Leu His Ser Thr Ser Gly Asn Gly Ser His Thr Gly Gly 
180 185 190 

lie Ala Gly Thr Asn Tyr Arg Gly He Val Arg Lys Ala Tyr Val Asp 
195 200 205 

Ala Thr lie Thr Gly Asn Lys Thr Arg Ala Ser Leu Leu Val Pro Lys 
210 215 220 

Val Asp Tyr Gly Leu Thr Leu Asp His Leu He Gly Thr Lys Ala Leu 
225 230 235 240 

Leu Thr Glu Ser Val Val Lys Gly Lys He Asp Val Ser Asn Pro Val 
245 250 255 

Glu Val Gly Ala He Ala Ser Lys Thr Trp Pro Val Gly Thr Val Ser 
260 265 270 
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Asn Ser Val Ser Tyr Ala Lys lie He Arg Gly Glu Glu Leu Phe Gly 
275 280 285 

Ser Asn Asp Val Asp Asp Ser Asp Tyr Ala Ser Ala His He Lys Asp 
290 295 300 

Leu Tyr Ala Val Glu Gly Tyr Ser Ser Gly Asn Arg Ser Phe Arg Lys 
305 310 315 320 

Ser Lvs Thr Phe Thr Lys Leu Thr Lys Glu Gin Ala Asp Ala Lys Val 
325 330 335 

Thr Thr Phe Asn He Thr Ala Asp Lys Leu Glu Ser Asp Leu Ser Pro 
340 345 350 

Leu Ala Lys Leu Asn Glu Glu Lys Ala Tyr Ser Ser He Gin Asp Tyr 
355 360 365 

Asn Ala Glu Tyr Asn Gin Ala Tyr Lys Asn Leu Glu Lys Leu He Pro 
370 375 380 

Phe Tyr Asn Lys Asp Tyr He Val Tyr Gin Gly Asn Lys Leu Asn Lys 
385 390 395 400 

Glu His His Leu Asn Thr Lys Glu Val Leu Ser Val Thr Ala Met Asn 
405 410 415 

Asn Asn Glu Phe He Thr Asn Leu Asp Glu Ala Asn Lys He He Val 
420 425 430 

His Tyr Ala Asp Gly Thr Lys Asp Tyr Phe Asn Leu Ser Ser Ser Ser 
435 440 445 

Glu Gly Leu Ser Asn Val Lys Glu Tyr Thr He Thr Asp Leu Gly He 
450 455 460 

Lvs Tvr Thr Pro Asn He Val Gin Lys Asp Asn Thr Thr Leu Val Asn 
465 470 475 480 

Asd He Lys Ser He Leu Glu Ser Val Glu Leu Gin Ser Gin Thr Met 
V 485 490 495 

Tvr Gin His Leu Asn Arg Leu Gly Asp Tyr Arg Val Asn Ala He Lys 
500 505 510 

Asd Leu Tyr Leu Glu Glu Ser Phe Thr Asp Val Lys Glu Asn Leu Thr 
515 520 525 

Asn Leu He Thr Lys Leu Val Gin Asn Glu Glu His Gin Leu Asn Asp 
530 535 540 

Ser Pro Ala Ala Arg Gin Met He Arg Asp Lys Val Glu Lys Asn Lys 
545 550 555 560 

Ala Ala Leu Leu Leu Gly Leu Thr Tyr Leu Asn Arg Tyr Tyr Gly Val 
565 570 575 

Lvs Phe Gly Asp Val Asn He Lys Glu Leu Met Leu Phe Lys Pro Asp 
580 585 590 

Phe Tyr Gly Glu Lys Val Ser Val Leu Asp Arg Leu He Glu He Gly 
595 600 605 
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Ser Lys Glu Asn Asn He Lys Gly Ser Arg Thr Phe Asp Ala Phe Gly 
610 615 620 

Gin Val 
625 

(2) INFORMATION FOR SEQ ID NO : 221: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



ATTAGACAGA 


TTAATTGAAA 


TCGGTTCTAA 


AGAGAACAAC 


ATTAAAGGTT 


CACGTACATT 


60 


CGACGCATTC 


GGTCAAGTAT 


TGGCTAAATA 


TACTAAATCA 


GGTAATTTAG 


ATGCATTTTT 


120 


AAATTATAAT 


AGACAATTGT 


TCACAAATAT 


AGACAATATG 


AAC GATTGGT 


TTATTGATGC 


180 


TACAGAAGAC 


CATGTCTACA 


TCGCAGAACG 


CGCTTCTGAG 


GTC GAAGAAA 


TTAAAAATTC 


240 


TAAACATCGT 


GCATTCGATA 


ATTTAAAACG 


AAGTCACCTT 


AGAAATACTA 


TACTCCCACT 


300 


ACTGAATATT 


GATAAAGCAC 


ATCTTTATTT 


AATTTCAAAT 


TATAATGCAA 


TTGCCTTTGG 


360 


TAGTGCAGAG 


CGATTAGGTA 


AAAAATCATT 


AGAAGATATT 


AAAGATATCG 


TTAACAAAGC 


420 


TGCAGATGGT 


TATAGAAACT 


ATTATGATTT 


CTGGTATCGT 


CTAGCGTCTG 


ATAACGTTAA 


480 


ACAACGACTA 


CTAAGAGATG 


CTGTTATTCC 


TATTTGGGAA 


GGTTATAACG 


CTCCTGGTGG 


540 


ATGGGTTGAA 


AAATATGGCC 


GCTATAATAC 


C G AC AAAGT A 


TATACTCCTC 


TTAGAGAATT 


600 


CTTTGGTCCT 


ATGGATAAGT 


ATTATAATTA 


TAATGGAACA 


GGAGCTTATG 


CTGCTATATA 


660 


TCCTAACTCT 


GATGATATTA 


GAACTGATGT 


AAAATATGTT 


CATTTAGAAA 


TGGTTGGTGA 


720 


AT AC GGT ATT 


TCAGTTTACA 


CACATGAAAC 


AAC AC AC GTC 


AACGACCGTG 


CGATTTACTT 


780 


AGGTGGC TTT 


GGACACCGTG 


AAGGTAC TGA 


TGCTGAAGCA 


TATGCTCAGG 


GTATGCTACA 


840 


AACTCCTGTT 


ACTGGTAGTG 


GATTTGATGA 


GTTTGGTTCT 


TTAGGTATTA 


ATATGGTATT 


900 


TAAACGCAAA 


AATGATGGGA 


ATCAGTGGTA 


TATTACAGAT 


CCAAAAACTC 


T AAAAAC AC G 


960 


AGAAGATATT 


AATAGATATA 


TGAAGGGTTA 


T AATGAC AC T 


TTAACTCTTC 


TTGATGAAAT 


1020 


TGAGGCTGAA 


TCTGTGATTT 


CTCAACAAAA 


TAAAGATTTA 


AATAGTGCAT 


GGTTCAAAAA 


1080 


AATAGATAGA 


GAATACCGTG 


ATAACAATAA 


ATTAAATCAA 


TGGGATAAAA 


TTCGAAATCT 


1140 


AAGTCAAGAA 


GAGAAAAATG 


AATTAAATAT 


TCAATCTGTT 


AATGATTTAG 


TTGATCAACA 


1200 


ATTAATGACT 


AATCGCAATC 


CAGGTAATGG 


T ATC TAT AAA 


CCCGAAGCAA 


TTAGCTATAA 


1260 


CGATCAATCA 


CCTTATGTAG 


GTGTTAGAAT 


GATGAC C GGT 


ATCTACGGAG 


GTAAT AC TAG 


1320 


TAAAGGTGCT 


CCTGGAGCTG 


TTTCATTCAA 


ACATAATGCT 


TTTAGATTAT 


GGGGTTACTA 


1380 
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C GG AT AC G AA AATGGGTTCT TAGGTTATGC TTCAAATAAA TATAAACAAC AATC TAAAAC 1440 

AGATGGTGAG TCTGTTCTAA GTGATGAATA TATTATCAAG AAAATATCTA ACAATACATT 1500 

TAATACTATT GAAGAATTTA AAAAAGCTTA CTTCAAAGAA GTTAAAGATA AAGCAACGAA 15 6 0 

AGGATTAACA ACATTCGAAG TAAATGGTTC TTCCGTTTCA TCATACGATG ATTT AC TG AC 162 0 

ATTGTTTAAA GAAGCTGTTA AAAAAGATGC CGAAACTCTT AAACAAGAAG CAAACGGTAA 1680 

TAAAACAGTA TCTATGAATA ATACAGTTAA ATTAAAAGAA GCTGTTTATA AGAAACTTCT 1740 

TCAACAAACA AATAGCTTTA AAACTTCAAT CTTTAAA 177 7 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Leu Asp Arg Leu He Glu He Gly Ser Lys Glu Asn Asn He Lys Gly 
1 5 ,10 15 

Ser Arg Thr Phe Asp Ala Phe Gly Gin Val Leu Ala Lys Tyr Thr Lys 
20 25 30 

Ser Gly Asn Leu Asp Ala Phe Leu Asn Tyr Asn Arg Gin Leu Phe Thr 
35 40 45 

Asn He Asp Asn Met Asn Asp Trp Phe He Asp Ala Thr Glu Asp His " 
50 55 60 

Val Tyr lie Ala Glu Arg Ala Ser Glu Val Glu Glu He Lys Asn Ser 
65 70 75 80 

Lys His Arg Ala Phe Asp Asn Leu Lys Arg Ser His Leu Arg Asn Thr 
85 90 95 

He Leu Pro Leu Leu Asn He Asp Lys Ala His Leu Tyr Leu He Ser 
100 105 HO 

Asn Tyr Asn Ala He Ala Phe Gly Ser Ala Glu Arg Leu Gly Lys Lys 
115 120 125 

Ser Leu Glu Asp lie Lys Asp He Val Asn Lys Ala Ala Asp Gly Tyr 
130 135 140 

Arg Asn Tyr Tyr Asp Phe Trp Tyr Arg Leu Ala Ser Asp Asn Val Lys 
145 150 155 160 

Gin Arg Leu Leu Arg Asp Ala Val He Pro He Trp Glu Gly Tyr Asn 
165 170 175 

Ala Pro Gly Gly Trp Val Glu Lys Tyr Gly Arg Tyr Asn Thr Asp Lys 
180 185 190 

Val Tyr Thr Pro Leu Arg Glu Phe Phe Gly Pro Met Asp Lys Tyr Tyr 
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195 200 205 

Asn Tyr Asn Gly Thr Gly Ala Tyr Ala Ala lie Tyr Pro Asn Ser Asp 
210 215 220 

Asp He Arg Thr Asp Val Lys Tyr Val His Leu Glu Met Val Gly Glu 
225 230 235 240 

Tyr Gly He Ser Val Tyr Thr His Glu Thr Thr His Val Asn Asp Arg 
245 250 255 

Ala He Tyr Leu Gly Gly Phe Gly His Arg Glu Gly Thr Asp Ala Glu 
260 265 270 

Ala Tyr Ala Gin Gly Met Leu Gin Thr Pro Val Thr Gly Ser Gly Phe 
275 280 285 

Asp Glu Phe Gly Ser Leu Gly He Asn Met Val Phe Lys Arg Lys Asn 
290 295 300 

Asp Gly Asn Gin Trp Tyr He Thr Asp Pro Lys Thr Leu Lys Thr Arg 
305 310 315 320 

Glu Asp He Asn Arg Tyr Met Lys Gly Tyr Asn Asp Thr Leu Thr Leu 
325 330 335 

Leu Asp Glu He Glu Ala Glu Ser Val He Ser Gin Gin Asn Lys Asp 
340 345 350 

Leu Asn Ser Ala Trp Phe Lys Lys He Asp Arg Glu Tyr Arg Asp Asn 
355 360 365 

Asn Lys Leu Asn Gin Trp Asp Lys He Arg Asn Leu Ser Gin Glu Glu 
370 375 380 

Lys Asn Glu Leu Asn He Gin Ser Val Asn Asp Leu Val Asp Gin Gin 
385 390 395 400 

Leu Met Thr Asn Arg Asn Pro Gly Asn Gly He Tyr Lys Pro Glu Ala 
405 410 415 

He Ser Tyr Asn Asp Gin Ser Pro Tyr Val Gly Val Arg Met Met Thr 
420 425 430 

Gly lie Tyr Gly Gly Asn Thr Ser Lys Gly Ala Pro Gly Ala Val Ser 
435 440 445 

Phe Lys His Asn Ala Phe Arg Leu Trp Gly Tyr Tyr Gly Tyr Glu Asn 
450 455 460 

Gly Phe Leu Gly Tyr Ala Ser Asn Lys , Tyr Lys Gin Gin Ser Lys Thr 
465 470 475 480 

Asp Gly Glu Ser Val Leu Ser Asp Glu Tyr He He Lys Lys He Ser 
485 490 495 

Asn Asn Thr Phe Asn Thr He Glu Glu Phe Lys Lys Ala Tyr Phe Lys 
500 505 510 

Glu Val Lys Asp Lys Ala Thr Lys Gly Leu Thr Thr Phe Glu Val Asn 
515 520 525 

Gly Ser Ser Val Ser Ser Tyr Asp Asp Leu Leu Thr Leu Phe Lys Glu 
530 535 . 540 



218 



Ala Val Lys Lys Asp Ala Glu Thr Leu Lys Gin Glu Ala Asn Gly Asn 
545 550 555 560 

Lvs Thr Val Ser Met Asn Asn Thr Val Lys Leu Lys Glu Ala Val Tyr 
565 570 575 

Lys Lys Leu Leu Gin Gin Thr Asn Ser Phe Lys Thr Ser He Phe Lys 
580 585 590 



(2) INFORMATION FOR SEQ ID NO : 22 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TAAGACAGAT GAAC GGAGC A AGGTGTTTGA CTTTTCCATT C C C T AC TATA CTGCAAAAAA 6 0 

TAAACTCATT GTCAAAAAAT CTGACTTGAC TACTTATCAG TCTGTAAACG AC TTGGC GC A 12 0 

GAAAAAGGTT GGAGC GC AGA AAGGTTCGAT TC AAGAGAC G ATGGC G AAAG ATTTGCTACA 18 0 

AAATTCTTCC CTCGTATCTC TGC CTAAAAA TGGGAATTTA ATC AC AG ATT TAAAATCAGG 240 

ACAAGTGGAT GCCGTTATCT TTGAAGAACC TGTTTC C AAG GGATTTGTGG AAAATAATCC 300 

TGATTT AGC A ATCGCAGACC TCAATTTTGA AAAAGAGCAA GATGATTCCT ACGCGGTAGC 360 

CATgAAAAAA GATAGCAAGA AATTGAAGAG GC AGTTC GAT AAAACCATTC AAAAGTTGAA 42 0 

GGAGTCTGGG GAATTAGACA AACTCATTGA GGAAGCCTTA 46 0 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Lys Thr Asp Glu Arg Ser Lys Val Phe Asp Phe Ser He Pro Tyr Tyr 
x 5 10 15 

Thr Ala Lys Asn Lys Leu He Val Lys Lys Ser Asp Leu Thr Thr Tyr 
20 25. 30 

Gin Ser Val Asn Asp Leu Ala Gin Lys Lys Val Gly Ala Gin Lys Gly 
35 40 45 

Ser He Gin Glu Thr Met Ala Lys Asp Leu Leu Gin Asn Ser Ser Leu 
50 55 60 

Val Ser Leu Pro Lys Asn Gly Asn Leu He Thr Asp Leu Lys Ser Gly 
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65 70 75 80 

Gin Val Asp Ala Val lie Phe Glu Glu Pro Val Ser Lys Gly Phe Val 
85 90 95 

Glu Asn Asn Pro Asp Leu Ala He Ala Asp Leu Asn Phe Glu Lys Glu 
100 105 110 

Gin Asp Asp Ser Tyr Ala Val Ala Met Lys Lys Asp Ser Lys Lys Leu 
115 120 - 125 

Lys Arg Gin Phe Asp Lys Thr He Gin Lys Leu Lys Glu Ser Gly Glu 
130 135 140 

Leu Asp Lys Leu He Glu Glu Ala Leu 
145 150 

(2) INFORMATION FOR SEQ ID NO : 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 51 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



CTGTGAGAAT 


CAAGCTACAC 


C C AAAG AG AC 


TAGCGCTCAA 


AAGACAATCG 


TCCTTGCTAC 


60 


AGCTGGCGAC 


GTGCCACCAT 


TTGACTACGA 


AGACAAGGGC 


AATC TGAC AG 


GC TTTGATAT 


120 


CGAAGTTTTA 


AAGGCAGTAG 


ATGAAAAACT 


CAGCGACTAC 


GAGATTCAAT 


TCCAAAGAAC 


180 


CGCCTGGGAG 


AGCATCTTCC 


CAGGACTTGA 


TTC TGGTC AC 


TATCAGGCTG 


CGGCCAATAA 


240 


C TTGAGTT AC 


ACAAAAGAGC 


GTGCTGAAAA 


AT AC C TTT AC 


TCGCTTCCAA 


TTTCCAACAA 


300 


TCCCCTCGTC 


CTTGTCAGCA 


ACAAGAAAAA 


TCCTTTGACT 


TCTCTTGACC 


AGATCGCTGG 


360 


TAAAACAACA 


CAAGAGGATA 


CCGGAACTTC 


TAACGCTCAA 


TTCATCAATA 


ACTGGAATCA 


420 


GAAACACACT 


GATAATCCCG 


CTACAATTAA 


TTTTTCTGGT 


GAGGATATTG 


GTAAACGAAT 


480 


CCTAGACCTT 


GCTAACGGAG 


AGTTTGATTT 


CCTAGTTTTT 


GACAAGGTAT 


CCGTTCAAAA 


540 


GATTATCAAG 


GACCGTGGTT 


TAGACCTCTC 


AGTC GTTGAT 


TTACCTTCTG 


CAGATAGCCC 


600 


CAGCAATTAT 


ATCATTTTCT 


CAAGCGACCA 


AAAAGAGTTT 


AAAGAGCAAT 


TTGATAAAGC 


660 


GCTCAAAGAA 


CTCTATCAAG 


ACGGAACCCT 


TGAAAAACTC 


AGCAATACCT 


AT C T AGGTGG 


720 


TTCTTACCTC 


CCAGATCAAT 


CTCAGTTACA 


A 






751 



(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



220 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6: 

Cys Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser Ala Gin Lys Thr lie 
15 10 15 

Val Leu Ala Thr Ala Gly Asp Val Pro Pro Phe Asp Tyr Glu Asp Lys 
20 25 30 

Gly Asn Leu Thr Gly Phe Asp lie Glu Val Leu Lys Ala Val Asp Glu 
35 40 45 

Lys Leu Ser Asp Tyr Glu lie Gin Phe Gin Arg Thr Ala Trp Glu Ser 
50 55 60 

lie Phe Pro Gly Leu Asp Ser Gly His Tyr Gin Ala Ala Ala Asn Asn 
65 70 75 80 

Leu Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro 
85 90 95 

lie Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu 
100 105 110 

Thr Ser Leu Asp Gin lie Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly 
115 120 125 

Thr Ser Asn Ala Gin Phe lie Asn Asn Trp Asn Gin Lys His Thr Asp 
130 135 140 

Asn Pro Ala Thr lie Asn Phe Ser Gly Glu Asp lie Gly Lys Arg lie 
145 150 155 160 

Leu Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val 
165 170 175 

Ser Val Gin Lys lie lie Lys Asp Arg Gly Leu Asp Leu Ser Val Val 
180 185 190 

Asp Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr lie lie Phe Ser Ser 
195 200 205 

Asp Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu Leu 
210 215 220 

Tyr Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly 
225 230 235 240 

Ser Tyr Leu Pro Asp Gin Ser Gin Leu Gin 
245 250 

INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
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GACTGGATCC TAAAATCTAC GACAATAAAA ATC 
(2) INFORMATION FOR SEQ ID NO: 22 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTGAGTCGAC TGGTTGTGCT GGTTGAG 
(2) INFORMATION FOR SEQ ID NO : 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTCAGGATCC AAATTACAAT ACGGACTATG 
(2) INFORMATION FOR SEQ ID NO : 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGTCGAC TAACTCTAGG TCGGAAAC 
(2) INFORMATION FOR SEQ ID NO : 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TGAGAATCAA GCTACACCCA AAGAG 
(2) INFORMATION FOR SEQ ID NO: 23 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 



(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
AGTCAAGCTT TTGTAACTGA GATTGATCTG G 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
GACTGGATCC TGGTAACCGC TCTTCTC GTA ACGCAGC 
(2) INFORMATION FOR SEQ ID NO : 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
AGTCAAGCTT TTTCAGGAAC TTTTACGCTT CC 
(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
AGTCAGATCT TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGC 
(2) INFORMATION FOR SEQ ID NO : 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 6: 
AC TGAAGC TT TTTTGTTTTT CAAGAATTCA TCG 
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(2) INFORMATION FOR SEQ ID NO : 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 
GACTGGATCC TGGTCAAGGA ACTGCTTCTA AAGAC 
(2) INFORMATION FOR SEQ ID NO : 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
AGTCAAGCTT TCACAAATTC GTTGGTGAAG CC 
(2) INFORMATION FOR SEQ ID NO : 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 9: 
GACTGGATCC TAGCTCAGGT GGAAACGCTG GTTCATCC 
(2) INFORMATION FOR SEQ ID NO : 24 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 240: 
AGTCAAGCTT ATCAACTTTT CCACCTTCAA CAACC 
(2) INFORMATION FOR SEQ ID NO : 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 241 
GTCAAGATCT CTCCAACTAT GGTAAATCTG CGGATGG 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 42 
AGTCCTGCAG ATCCACATCC GCTTTCATCG GGTTAAAGAA GG 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
GACTGGATCC TGGGAAAAAT TCTAGCGAAA CTAGTGG 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
GTCACTGCAG CTGTCCTTCT TTTACTTCTT TGGTTGC 
(2) INFORMATION FOR SEQ ID NO: 245:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 245 
GACTGGATCC TGCTAGCGGA AAAAAAGATA CAACTTCTGG 
(2) INFORMATION FOR SEQ ID NO : 246: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
CTGAAAGCTT TTTTGCCAAT CCTTCAGCAA TCTTGTC 
(2) INFORMATION FOR SEQ ID- NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 
GACTAGATCT TGGCTCAAAA AATACAGCTT CAAGTCC 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 
AGTCCTGCAG GTTTTTGTTT GCTTGGTATT GGTCG 
(2) INFORMATION FOR SEQ ID NO : 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
GACTGGATCC TAGTACAAAC TCAAGCACTA GTCAGACAGA G 
(2) INFORMATION FOR SEQ ID NO: 2 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 250 
CAGTCTGCAG TTTCAAAGCT TTTTGTATGT CTTC 
(2) INFORMATION FOR SEQ ID NO: 2 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 51 
GACTGGATCC TGGCAATTCT GGCGGAAGTA AAGATGC 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 52 
AGTCAAGCTT GTTTCATAGC TTTTTTGATT GTTTCG 
(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 
GACTGGATCC TTCACAAGAA AAAACAAAAA ATGAAGATGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 

AGTCAAGCTT ATCGACGTAG TCTCCGCCTT C 

(2) INFORMATION FOR SEQ ID NO : 255: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 8 base pairs 



r 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
GACTGGATCC GAAAGGTCTG TGGTCAAATA ATCTTACC 
(2) INFORMATION FOR SEQ ID NO : 25 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256; 
AGTCAAGCTT AGAGTTAACA TGGTGCTTGC CAATAGG 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
GACTGGATCC AAACTCAGAA AAGAAAGCAG ACAATGC 
(2) INFORMATION FOR SEQ ID NO: 258: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 58: 
AGTCAAGCTT CCAAACTGGT TGATCCAAAC CATCTG 
(2) INFORMATION FOR SEQ ID NO: 25 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
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GACTGGATCC TTCGAAAGGG TCAGAAGGTG CAGACC 3 6 

(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 60: 
AGTCAAGCTT CTGTAGGCTT GGTGTGCCCC AGTTGC 3 6 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs v 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 61: 
CTGAGGATCC GGGGATGGCA GCTTTTAAAA ATC 33 
(2) INFORMATION FOR SEQ ID NO : 2 62: 

(i) SEQUENCE CHARACTERISTICS: J 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
CAGTAAGCTT GTTTAC C CAT TC AC C ATT AC C 31 
(2) INFORMATION FOR SEQ ID NO : 2 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
CAGTGGATCC AGACGAGCAA AAAATTAAG 
" (2) INFORMATION FOR SEQ ID NO : 2 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
TCAGAAGCTT GTTTAC C CAT TCACCATT 
(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
GACTGGATCC CTGTGGTGAG GAAGAAACTA AAAAG 
(2) INFORMATION FOR SEQ ID NO : 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
CTGAGTC GAC AATATTCTGT AGGAATGCTT CGAATTTG 
(2) INFORMATION FOR SEQ ID NO : 2 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 67: 
CTGAGGATCC GACTTTTAAC AATAAAACTA TTGAAGAG 
(2) INFORMATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
GTCACTGCAG GTTGTCACCT C C AAAAATC A CGG 
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(2) INFORMATION FOR SEQ ID NO : 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
GACTGGATCC CTTTACAGGT AAACAACTAC AAGTCGG 
(2) INFORMATION FOR SEQ ID NO : 27 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 0: 
CAGTAAGCTT TTC GAAGTTT GGCTCAGAAT TG 
(2) INFORMATION FOR SEQ ID NO : 2 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 
GACTGGATCC CCAGGCTGAT ACAAGTATCG CA 
(2) INFORMATION FOR SEQ ID NO : 27 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
CAGTAAGCTT ATCTGCAGTA TGGCTAGATG G 
(2) INFORMATION FOR SEQ ID NO : 27 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
G AC TGGATC C GTCTGTATCA TTTGAAAACA AAGAAAC 
(2) INFORMATION FOR SEQ ID NO: 27 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
CAGTCTGCAG TTTTACTGTT GCTGTGCTTG TG 
(2) INFORMATION FOR SEQ ID NO : 27 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 
AC TGAGATCT TGGTCAAAAG GAAAGTCAGA CAGGAAAGG 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 
CAGTAAGCTT ATTCCTGAGC TTTTTTGATA AAGGTTGCGC A 
(2) INFORMATION FOR SEQ ID NO : 2 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 
AC TGGGATC C GAAGGATAGA TATATTTTAG CATTTGAGAC 
(2) INFORMATION FOR SEQ ID NO : 27 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGTCAAGCTT CCATGGTATC AAAGGCAAGA CTTGG 
(2) INFORMATION FOR SEQ ID NO : 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTCAGGATCC GGTAGTTAAA GTTGGTATTA ACGG 
(2) INFORMATION FOR SEQ ID NO: 2 '8 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT GCAATTTTTG CGAAGTATTC CAAGAG 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCGGATCC TTCTTACGAG TTGGGACTGT ATCAAGC 
(2) INFORMATION FOR SEQ ID NO: 2 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 82: 
AGTCAAGCTT GTTTATTTTT TCCTTACTTA CAGATGAAGG 
(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 
AGTCGGATCC TACTGAGATG CATCATAATC TAGGAGC 
(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 84: 

TCAGCTCGAG TTCTTTGACA TCTCCATCAT AAGTCGC 

(2) INFORMATION FOR SEQ ID NO : 2 85: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 85: 
GACTGGATCC GGTTTTGAGA AAGTATTTGC AGGGG 
(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 286: 

CAGTAAGCTT GGATTTTTTC ATGGATGCAA TTTTTTTGG 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS": 
(A) LENGTH: 43 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
GACTGGATCC GACAACATTT ACTATCCATA CAGTAGAGTC AGC 
(2) INFORMATION FOR SEQ ID NO : 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 88: 
GACTAAGCTT GGCATAAGGT TGCAATTCTG GATTAATTGG 
(2) INFORMATION FOR SEQ ID NO : 2 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 89: 

GACTGGATCC GGCTAAGGAA AGAGTGGATG 

(2) INFORMATION FOR SEQ ID NO : 29 0: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 7 base pairs 
.(B) TYPE:, nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 90: 
GACTAAGCTT TTCATTTTTA AATTG AC TAT GCGCCCG 
(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 291: 
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GACTGGATCC TTGTTCCTAT GAACTTGGTC GTCACC 
(2) INFORMATION FOR SEQ ID NO: 292: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CATGAAGCTT ATCCTGGATT TTTCCAAGTA AATCT 
(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGGATCC TTATAAGGGT GAATTAGAAA AAGG 
(2) INFORMATION FOR SEQ ID NO : 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTAAGCTT C TT ATTAGGA TTGTTAGTAG TTG 
(2) INFORMATION FOR SEQ ID NO : 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGGATCC GAATGTTCAG GCTCAAGAAA GTTCAGG 
(2) INFORMATION FOR SEQ ID NO : 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GACTAAGCTT TTCCCCTGAT GGAGCAAAGT AATACC 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 9 
GACTGGATCC C TTGGGTGT A ACCCATATCC AGCTCCTTCC 
(2) INFORMATION FOR SEQ ID NO: 2 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
GACTGTCGAC TTCAGCTTGT TTATCTGGGG TTGC 
(2) INFORMATION FOR SEQ ID NO : 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
GACTGGATCC TAGTGATGGT ACTTGGCAAG GAAAACAG 
(2) INFORMATION FOR SEQ ID NO : 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
ACTGCTGCAG ATCTTTGCCA CCTAGCTTCT CATTG 
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(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GTC AGGATC C TGGGATTCAA TATGTCAGAG ATG AT AC TAG 
(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRAND EDNE S S : doub 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CTAGAAGCTT ACGCACCCAT TC AC C ATT AT CATTG 
(2) INFORMATION FOR SEQ ID NO: 3 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GTCAGGATCC GGATAATAGA GAAGCATTAA AAACC 
(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
AGTCAAGCTT GACAAAATCT TGAAACTCCT CTGGTC 
(2) INFORMATION FOR SEQ ID NO: 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 05: 
GTCAGGATCC AGATTTTGTC GAGGAGTGTC ATACC 
(2) INFORMATION FOR SEQ ID NO: 3 06: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 
AGTCAAGCTT TCCCTTTTTA CCCTTACGAA TCCAGG 
(2) INFORMATION FOR SEQ ID NO: 3 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GACTGGATCC ATCTGTAGTT TATGCGGATG AAACACTTAT TAC 
(2) INFORMATION FOR SEQ ID NO: 3 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 08: 
GACTGTCGAC GCTTTGGTAG AGATAGAAGT CATG 
(2) INFORMATION FOR SEQ ID NO : 3 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 09: 
GACTGGATCC TTACTTTGGT ATCGTAGATA CAGCCGGC 
(2) INFORMATION FOR SEQ ID NO: 310: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C ) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGTCAAGCTT TGTTAATTGC GTACCTTCTA AGCGACC 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGGATCC AGCTAAGGTT GCATGGGATG CGATTCG 
(2) INFORMATION FOR SEQ ID NO : 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGTCGAC CTGGGCTTTA TTAGTTTGAC TAGC 
(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO 
CAGTGGATCC CTATCACTAT GTAAATAAAG AGA 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 314: 
ACTGAAGCTT TTCTGTCCCT GTTTGAGGCA 
(2) INFORMATION FOR SEQ ID NO : 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
CAGTGGATCC TGAGACTCCT CAATCAATAA CAAA 34 
(2) INFORMATION FOR SEQ ID NO : 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 316: 
ACGTAAGCTT ATAATCAGTA GGAGAAACTG AACT 34 
(2) INFORMATION FOR SEQ ID NO : 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 317: 
CAGTGGATCC GGATGCTCAA GAAACTGCGG 30 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

GACTAAGCTT TTGCCTCTCA TTCTTGCTTC C 31 

(2) INFORMATION FOR SEQ ID NO : 319: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
CAGTGGATCC CGACAAAGGT GAGACTGAG 
(2) INFORMATION FOR SEQ ID NO : 320: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 320: 
ACGTAAGCTT ATTTCTTAAT TCAAGTGTTT TCTCTG 
(2) INFORMATION FOR SEQ ID NO : 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 321: 
GACTGGATCC AAATCAATTG GTAGCACAAG ATCC 
(2) INFORMATION FOR SEQ ID NO : 3 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
CAGTGTCGAC ATT AGGAGC C ACTGGTCTC 
(2) INFORMATION FOR SEQ ID NO : 3 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
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CAGTGGATCC CAAACAGTCA GCTTCAGGAA C 
(2) INFORMATION FOR SEQ ID NO : 3 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTCTGCAG TTTAATCTTG TCCCAGGTGG 
(2) INFORMATION FOR SEQ ID NO : 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC ATTCGATGAT GCGGATGAAA AG 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT CATTTGTCTT TGGGTATTTC GCA 
(2) INFORMATION FOR SEQ ID NO : 3 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGGATCC GGAGAGTCGA TCAAAAGTAG 
(2) INFORMATION FOR SEQ ID NO : 32 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 8: 
GTC AC TGC AG TTGCTCGTCT CGAGGTTC 
(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
CAGTGGATCC ATGGACAACA GGAAAC TGGG AC 
(2) INFORMATION FOR SEQ ID NO : 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 
CAGTAAGCTT ATTAGCTTCT GTACCTGTGT TTG 
(2) INFORMATION FOR SEQ ID NO : 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 31 
GACTGGATCC C GATGGGC TC AATCCAACCC CAGGTCAAGT C 
{2Y INFORMATION FOR SEQ ID NO : 3 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 332 
GACTCTGCAG CATAGCTTTA TCCTCTGACA TCATCGTATC 
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(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGGATCC TTCCAATCAA AAACAGGCAG ATGG 
(2) INFORMATION FOR SEQ ID NO: 3 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT GAGTCCCATA GTCCAAGGCA 
(2) INFORMATION FOR SEQ ID NO : 3 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCGGATCC TATCACAGGA TCGAACGGTA AGACAACC 
(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACTGGTCGAC TTCTTTTAAC TCCGCTACTG TGTC 
(2) INFORMATION FOR SEQ ID NO: 3 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGTGGATCC AAGTTCATCG AAGATGGTTG GGAAGTCC 
(2) INFORMATION FOR SEQ ID NO : 3 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GATCGTCGAC CCGCTCCCAC ATGCTCAACC TT 
(2) INFORMATION FOR SEQ ID NO : 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs , 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TGACGGATCC V ATCGCTAGCT AGTGAAATGC AAGAAAG 
(2) INFORMATION FOR SEQ ID NO : 3 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO 
TGACAAGCTT ATTC GTTTTT GAAC T AGTTG CTTTCGT 
(2) INFORMATION FOR SEQ ID NO : 3 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC GCACCAGATG GGGCACAAGG TTCAGGG 
(2) INFORMATION FOR SEQ ID NO : 342: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TGACAAGCTT AACTTGTAAC GAACAGTTCA ATCTG 
(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTAGATCT TTTTAAC C C A ACTGTTGGTA CTTTCC 
(2) INFORMATION FOR SEQ ID NO : 3 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TGACAAGCTT GTTAGGTGTT ACATTTTGAC CGTC 
(2) INFORMATION FOR SEQ ID NO : 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACTGAGATCT TTTTAAC C C A ACTGTTGGTA CTTTC 
(2) INFORMATION FOR ' SEQ ID NO : 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTAAGCTT TCTACGATAA CGATCATTTT CTTTACC 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGTCGAC TC GTAGATAT TTAAGTCTAA GTGAAGCG 
(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT GTTAGGTGTT ACATTTTGCA AGTC 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC CTTTGGTTTT GAAGGAAGTA AG 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

TGACCTGCAG ACGATTTTTG AAAAATGGAG GTGTATC 

(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
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(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
CAGTGGATCC CTACTACCTC TC GAG AGAAA G 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 52: 
ACTGAAGCTT TTCGCTTTTT ACTCGTTTGA CA 
(2) INFORMATION FOR SEQ ID NO : 3 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 
CAGTGGATCC TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GC 
(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
CAGTAAGCTT TAGGGTATCC AAAT AC TGGT TGTTGATG 
(2) INFORMATION FOR SEQ ID NO: 3 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNE S S : doub 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
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TGACAGATCT TGACGGGTCT CAGGATCAGA CTCAGG 
(2) INFORMATION FOR SEQ ID NO: 3 56: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 
TGACAAGCTT CAAAGACATC CACCTCTTGA CCTTTG 
(2) INFORMATION FOR SEQ ID NO: 3 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 
GACTGGATCC TAGAGGCTTT GCCAAATGGT GGGAAGGG 
(2) INFORMATION FOR SEQ ID NO : 3 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GTCAGTCGAC TTGTTGTAAC ACTTTTCGAG GTTTGGTACC 
(2) INFORMATION FOR SEQ ID NO: 3 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
CAGTGGATCC TCAAAAAGAG AAGGAAAACT TGG 
(2) INFORMATION FOR SEQ ID NO : 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double , 
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( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGTCTGCAG TTTCTTCAAC AAACCTTGTT CTTG 
(2) INFORMATION FOR SEQ ID NO: 3 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGTGGATCC ACGTTCTATT GAGGACCACT T 
(2) INFORMATION FOR SEQ ID NO : 3 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTAAGCTT TTCCTTCTCA GTCAATTCTT TTCC 
(2) INFORMATION FOR SEQ ID NO : 3 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC CGCTCAAAAT ACCAGAGGTG TTCAG 
(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT AGTACCATGG GTGTGACAGG TTTGAA 
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(2) INFORMATION FOR SEQ ID NO : 3 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 
CTGAGGATCC AATTGTACAA TTAGAAAAAG ATAGC 
(2) INFORMATION FOR SEQ ID NO : 3 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 6 
TGACAAGCTT GCGTTGACTA GGTTCTGCAA TGCC 
(2) INFORMATION FOR SEQ ID NO : 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
GAC TGGATC C TCTGACCAAG CAAAAAGAAG CAGTCAATGA 
(2) INFORMATION FOR SEQ ID NO : 3 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 
TCAGCAGCTG , ATCATTGACT TT AC G ATTTG . CTCC 
(2) INFORMATION FOR SEQ ID NO : 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 
GACTGGATCC GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC G 
(2) INFORMATION FOR SEQ ID NO: 37 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 0 
TCAGAAGCTT ATTTTTTGTT TCCTTAATGC GTT 
(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 
GACTGGATCC GGGACAAATT CAAAAAAATA GGCAAGAGG 
(2) INFORMATION FOR SEQ ID NO : 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 
GTCAAAGCTT TGGCTCTTTG ATTGCCAACA ACTG 
(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 
GACTGGATCC TCGCTACCAG CAACAAAGCG AGCAAAAGG 
(2) INFORMATION FOR SEQ ID NO: 3 74: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GACTAAGCTT AC TTTTTTC T TTTTCCACAC GA 
(2) INFORMATION FOR SEQ ID NO : 37 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
CAGTGGATCC GAACCGACAA GTCGCCCACT ATCAAGACT 
(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
CTGAAAGCTT TGAATTCTCT TTC TTTTC AG GCT 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
TCGAGGATCC GGTTGTCGGC TGGCAATATA TCCCGT 
(2) INFORMATION FOR SEQ ID NO: 37 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



254 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
CAGTAAGCTT CCGAACCCAT TCGCCATTAT AGTTGAC 
(2) INFORMATION FOR SEQ ID NO : 37 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
AGTC GGATCC GGCCAAATCA GAATGGGTAG AAGAC 
(2) INFORMATION FOR SEQ ID NO : 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
TGACCTGCAG CTTCTCATTG ATTTTCATCA TCAC 
(2) INFORMATION FOR SEQ ID NO : 3 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
GACTGGATCC ATTTGCAGAT GATTCTGAAG GATGG 
(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

TCAGCTGCAG CTTAACCCAT TCACCATTCT AGTTTAAG 

(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GAC TGGATC C TGTCGCTGCA AATGAAACTG- AAGTAGC 
(2) INFORMATION FOR SEQ ID NO : 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTAAGCTT ATACCAAACG CTGACATCTA CGCG 
(2) INFORMATION FOR SEQ ID NO : 3 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAGATCT T AC GTCTC AG CCTACTTTTG TAAGAGC 
(2) INFORMATION FOR SEQ ID NO : 3 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT AACCCATTCA CCATTGGCAT TGAC 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
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CAGTGGATCC TGGACAGGTG AAAGGTCATG CTACATTTGT G 
(2) INFORMATION FOR SEQ ID NO : 3 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
GACTAAGCTT CAACCATTGA GACCTTGCAA CAC 
(2) INFORMATION FOR SEQ ID NO : 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 
GTCAGGATCC GATTGCTCCT TTGAAGGATT TGAGAGAAAC C 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 390: 
GACTAAGCTT CGATCAAAGA TAAGATAAAT ATATATAAAG T 
(2) INFORMATION FOR SEQ ID NO : 3 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
GACTGGATCC TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGG 
(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 
TGACAAGCTT ATCTATCAGC TCATTTAATC GTTTTTG 
(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CTGAGGATCC CAACGTTGAG AATTATTTGC GAATG 
(2) INFORMATION FOR SEQ ID NO : 3 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 
TGACAAGCTT GAGTCTACAA AAGTAATGTA C 
(2) INFORMATION FOR SEQ ID NO : 3 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 
GTCAGGATCC CTACTATCAA TCAAGTTCTT CAGCC 
(2) INFORMATION FOR SEQ ID NO : 3 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE : nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 396: 
TGACAAGCTT GACTGAGGCT TGGACCAGAT TGAAAAG 



37 
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(2) INFORMATION FOR SEQ ID NO : 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGGATCC GACAAAAACA TTAAAACGTC CTGAGG 
(2) INFORMATION FOR SEQ ID NO.: 3 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT AGCACGAACT GTGACGCTGG TTCC 
(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TTCTCAGGAG ACCTTTAAAA ATATC 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTAAGCTT GTTGGCCATC TTGTACATAC C 
(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
GACTGGATCC AGTAAATGCG CAATCAAATT C 
(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
AGTCCTGCAG GTATTTAGCC CAATAATCTA TAAAGCT 
(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 
CAGTGGATCC TTACCGCGTT CATCAAGATG TC 
(2) INFORMATION FOR SEQ ID NO : 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
GACTAAGCTT GCCAGATGTT GAAAAGAGAG TG 
(2) INFORMATION FOR SEQ ID NO : 4 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 
GACTGGATCC GTGGATGGGC TTTAACTATC TTCGTATTCG 
(2) INFORMATION FOR SEQ ID NO : 40 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGTCAAGCTT GCTAGTCTTC ACTTTCCCTT TCC 
(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTGTCGAC ACTAAAC C AG CATCGTTCGC AGGA 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTGACTGCAG CTTCTTGAAG AAATAATGAT TGTGG 
(2) INFORMATION FOR SEQ ID NO : 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGGATCC TGACTACCTT GAAATC C C AC TT 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 410: 
CAGTAAGCTT TTTTTTAAGG TTGTAGAATG ATTTCAATC 3 9 

(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 
CAGTGTCGAC TCGTATCTTT TTTTGGAGCA ATGTT 3 5 

(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 
GACTAAGCTT AAATGTTCCG ATACGGGTGA TTG 33 
(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE' DESCRIPTION: SEQ ID NO: 413: 
CAGTGGATCC GGACTCTCTC AAAGATGTGA AAG 33 
(2) INFORMATION FOR SEQ ID NO : 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

GACTAAGCTT CTTGAGTTTG TCAAGGATTG CTTT 34 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 8 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGTGGATCC CAAGAAATCC TATCATCTCT TCCAGAAG 
(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTAAGCTT TTCAGAACTA AAAGCCGCAG CTT 
(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAC TGGATCC AC G AAATGC A GGGCAGACAG 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTAAGCTT ATCAACATAA TCTAGTAAAT AAGCGT 
(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS:" 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
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CAGTGGATCC TGTATAGTTT TTAGCGCTTG TTCTTC 
(2) INFORMATION FOR SEQ ID NO : 42 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GTCAAAGCTT TGATAGAGTG TCATAATCTT CTTTAG 
(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAC TGGATC C GTGTGTCGAG CATATTCTGA AG 
(2) INFORMATION FOR SEQ ID NO : 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTAAGCTT ACTTTTACCA TTTCTTTGTT CTGCATC 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGTCGAC GTGTTTGGAT AGCATTCAGA ATCAGACG 
(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 
CAGTAAGCTT C GGAAGT AAA GACAATTTTT CC 
(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 425: 
CAGTGGATCC GTGCCTAGAT AGTATTATTA CTCAAAC 
(2) INFORMATION FOR SEQ ID NO: 42 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
GACTAAGCTT TTTGCTTATT TCTCTCAATT TTTC 
(2) INFORMATION FOR SEQ ID NO : 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
CAGTGGATCC CATTCAGAAG CAGACCTATC AAAATC 
(2) INFORMATION FOR SEQ ID NO: 42 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
ACTGAAGCTT ATGTAATTTT TTAGATTTTT CAATATTTTT CAG 
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(2) INFORMATION FOR SEQ ID NO : 42 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 
AGTCGGATCC TAAGGCTGAT AATCGTGTTC AAATG 
(2) INFORMATION FOR SEQ ID NO : 43 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430- 
GACTAAGCTT AAAATTAGAT AGACGTTGAG T 
(2) INFORMATION FOR SEQ ID NO : 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 
AGTCGGATCC CTGTGGCAAT CAGTCAGCTG CTTCC 
(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 
GACTGTCGAC TTTAATCTTG TCCCAGGTGG TTAATTTGCC 
(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 
ACTGGTCGAC TTGTCAACAA CAACATGCTA CTTCTGAG 
(2) INFORMATION FOR SEQ ID NO : 43 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 434: 
GACTCTGCAG AAGTTTAACC CACTTATCAT TATCC 
(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 
AC TGGGATC C TTGTTCAGGC AAGTCCGTGA CTAGTGAAC 
(2) INFORMATION FOR SEQ ID NO : 43 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6 
GACTAAGCTT GGCTAATTCC TTCAAAGTTT GCA 
(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 
AGTCGGATCC CTCGCAAATT GAAAAGGCGG CAGTTAGCC 
(2) INFORMATION FOR SEQ ID NO : 43 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 8: 
GACTAAGCTT GTAAATAAGC GTACCTTTTT CTTCC 3 5 

(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439:. 
TCAGGGATCC TTGTCAGTCA GGTTCTAATG GTTCTCAG 3 8 

(2) INFORMATION FOR SEQ ID NO : 44 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
AGTCAAGCTT GGCATTGGCG TCGCCGTCCT TC 
(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 
GACTGGATCC GGAAACTTCA CAGGATTTTA AAGAGAAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 442 
GACTGTCGAC AATCAATCCT TCTTCTGCAC TTCT 
(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 
CAGTGGATCC TGTGGTCGAA GTTGAGACTC CTCAATC 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 
GACTAAGCTT TTCTTCAAAT TTATTATCAG C 
(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

( B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 
AGTCGGATCC AACACCTGTA TATAAAGTTA CAGCAATCG 
(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37" base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6 

GACTGTCGAC TACTTGACCG AATGCGTCGA ATGTACG 

(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 
CTGAGGATCC ATTAGACAGA TTAATTGAAA TCGG 34 
(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 448: 
GACTGTCGAC TTTAAAGATT GAAGTTTTAA AGCT 34 
(2) INFORMATION FOR SEQ ID NO : 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 
TGACGGATCC TAAGACAGAT GAAC GGAGC A AGGTG 35 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 
CTGAAAGCTT TAAGGCTTCC TCAATGAGTT TGTCT 35 
(2) INFORMATION FOR SEQ ID NO : 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
■(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 451: 
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GACTGGATCC CTGTGAGAAT CAAGCTACAC CCA 3 3 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

( B ) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 
CTGAAAGCTT TTGTAACTGA GATTGATCTG GGAG 34 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

Leu Ala Ser Gly Ala Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Leu Pro Xaa Thr Gly Xaa 
1 5 



